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207 Human Secreted Proteins 
Field of the Invention 

This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or ■ 
organelle, contains different proteins essential for the function of the organelle. The cell 
uses "sorting signals/ 7 which are amino acid motifs located within- the protein, to target 
proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a leader 
sequence, directs a class of proteins to an organelle called the endoplasmic reticulum 
(ER). The ER separates the membrane-bounded proteins from all other types of 
proteins. Once localized to the. ER,' -both groups of proteins can be further directed to 
another organelle called the Golgi- apparatus. Here, the Golgi distributes the proteins to 
vesicles, including secretory vesicles, the cell membrane, lysosomes, and the other 
organelles. 

• Proteins targeted to- the ER by a signal sequence can be released into the 
extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the extracellular 
space - a process called exocytosis. Exocytosis can occur constitutively or after receipt 
of a triggering signal. In the latter case, the proteins are stored in secretory vesicles (or 
secretory granules) until exocytosis is triggered. Similarly, proteins residing on the cell 
membrane can also be secreted into the extracellular space by proteolytic cleavage of a 
"linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins include 
the commercially valuable human insulin, interferon, Factor VH[, human growth 
hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of the 
pervasive role of secreted proteins in human physiology, a need exists for identifying 
and characterizing novel human secreted proteins and the genes, that encode them. This 
knowledge will allow one to detect/ to treat, and to prevent medical disorders by using 
secreted proteins or the genes that encode them. 
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Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, antibodies, 
5 and recombinant methods for producing the polypeptides and polynucleotides. Also 
provided are diagnostic methods for detecting disorders related to the polypeptides, and 
therapeutic methods for treating such disorders. The invention further relates to 
screening methods for identifying binding partners of the polypeptides. 

10 Detailed Description 

Definitions ' 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to materia:! removed from its original 

15 environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of. man" from its natural state. For example, an isolated 
polynucleotide could be pan of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or. particular cell is not the original environment of the polynucleotide. 

20 In the present invention, a "secreted" protein refers to those proteins capable of * 

being directed to the ER, secretory vesicles, or the extracellular space as a result of a 
signal sequence, as well as those proteins released into the extracellular space without 
necessarily containing a signal sequence. If the secreted protein is released into the 
'extracellular space, the secreted protein can undergo extracellular processing to produce 

25 - a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ID NO:X or the cDNA contained within the clone deposited 
with the ATCC. For example, the polynucleotide can contain the nucleotide sequence 

30 of the full length cDNA sequence, including the 5' and 3' untranslated sequences, the 

coding region, with or without the signal-sequence,- the secreted protein "coding regionT" 

as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers to a molecule having the translated 
amino acid sequence generated from the polynucleotide as broadly defined. 

35 In the present invention, the full length sequence identified as SEQ ID NO:X 

was often generated by overlapping sequences contained in multiple clones (contig 
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analysis). A representative- clone containing ail or most of the sequence for SEQ ID 
NO:X was deposited with the American Type Culture Collection ("ATCC"). As 
' shown in Table 1, each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 
5 Manassas, Virginia 201 10-2209, USA. The ATCC deposit was made pursuant to the 
terms of . the Budapest Treaty on the international recognition of the deposit of 
microorganisms for purposes of patent procedure. 

'A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained 
10 in SEQ ID NO:X, the complement thereof, or the cDNA within the clone deposited with 

the ATCC. "Stringent hybridization conditions" refers to an overnight incubation at 42° 

C in a solution comprising 50% formamide, 5x SSC (750 rruVI NaCl, 75 miVl sodium 
citrate), 50 nuVI sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran 
sulfate, and 20 jig/ml denatured, sheared salmon sperm DNA, followed by washing -the 

15 filters in O.lx SSC at about 65°C. 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration<lowerpercentages 
20 of formamide result in lowered stringency); salt conditions, or temperature. For 

example, lower stringency conditions include an overnight incubation at 37°C in a 

solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH 2 P0 4 ; 0.02M EDTA, 
pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; 

followed by washes at 50°C with IXSSPE, 0.1% SDS. In addition, to achieve even 

25 lower stringency, washes performed following stringent hybridization can be done at 
higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 

30 Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 

commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, due 
to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA-f sequences (such 

35 as any 3' terminal polyA-r tract of a cDNA shown in the sequence listing), or to a 
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complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide/* since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g.,. practically any ■ 
double-stranded cDNA clone). 
5 The polynucleotide of the present invention can be composed of any 

polyribonucleotide orpolydeoxribonucleotide, which may be unmodified" RN A or DNA 
or modified RNA or DNA. For example, polynucleotides can be composed of single- 
and double-stranded DNA, DNA that is a mixture of single- and double-stranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and 

10 double-strandedregions, hybrid molecules comprising DNA and RNA that may be 

single-stranded or, more typically, double -stranded or a mixture of single- and double- 
stranded regions. In addition, the polynucleotide can be composed of triple-stranded 
regions comprising RNA or DNA or both RNA and DNA. A polynucleotide may also 
contain one or more modified bases or DNA or RNA backbones modified for stability 

15 or for other reasons. "Modified" bases include, for example, tritylated bases and 
unusual bases such as inosine. A variety of modifications can be made to DNA and 
RNA; thus, "polynucleotide" embraces chemically, enzymatically, or metabolicaliy 
modified forms. 

The polypeptide of the present invention can be composed of amino acids joined ' 
20 to each other by peptide bonds or modified pepdde bonds, i.e., peptide isosteres, and 
may contain amino acids other than the 20 gene-encoded amino acids. The 
polypeptides may be modified by either natural processes, such as posttranslational 
processing, or by chemical modification techniques which are well known in the art. 
Such modifications are well described in basic texts and in more detailed monographs, 
25 as well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side -chains and the amino 
or carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide may contain many, types of modifications. Polypeptides may be 
. 30 ' branched , for example, as a result of ubiquitination, and they may be cyclic, with or 
without branching. Cyclic; branched, and branched cyclic polypeptides may result 
. from posttranslation natural processes or may be made by synthetic methods. 
Modifications Include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme- moiety, covalent attachment of a 
35 nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivadve, 

covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond- 
formation, demethylation, formation of covalent cross-links, formation of cysteine, 
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formation of pyrogiutamate, formylation, garruna-carboxylation, glycosylation, GPI 
anchor formacioa, hydroxylauoa, iodination, methylation, myristoylation, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoyiation, sulfation, transfer-RNA mediated addition of amino acids to proteins 

5 such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 

STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. 
H. Freeman and Company, New York (1993); POSTTRANSLATIONAL 
COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic 
Press, New York, pgs. 1-12 (1983); Seifter et aL, Meth Enzymol 132:626-646 (1990); 

10 Rattan et aL, Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO: Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table I. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
15 activity similar, but not necessarily identical to, an activity of a polypeptide of the 

present invention, including mature forms, as measured in a particular biological assay, 
with or without dose dependency. In the case where dose dependency does exist, it 
need not be identical to that of the polypeptide, but rather substantially similar to the 
dose-dependence in a given activity as compared to the polypeptide of the present 
20 invention (i.e., the candidate polypeptide will exhibit greater activity or not more than 
about 25-fold less and, preferably, not more than about tenfold less activity, and most 
preferably, not more than about three-fold less activity relative to the polypeptide of the 
. present invention.) 

25 Polynucleotides and Polypeptides of the Invention 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

This gene is expressed primarily in melanocytes and, to a lesser extent, in 
testes, ovary, kidney and other tissues. 

30 . . Therefore, polynucleotides and polypepddes of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer, disorders of neural crest derived cells including pigmentation 
defects, melanoma, reproductive organ defects, and defects of the kidney. Similarly, 

35 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s)or cell cype(s). For 
a number of disorders of the above tissues or cells, particularly of the skin, 



WO 98/54963 



PCT7US98/11422 



6 

reproductive, and renal systems, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
5 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating disorders that arise from alterations in 
the number or fate of neural crest derived cells including cancers such as melanoma and 
10 defects of the developing reproductive system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

This gene is expressed primarily in infant brain and fetal lung. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential'"" identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include,. but are 
not limited to, developmental disorders of the brain or lung. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or ceil type(s). For a number of disorders 

20 of the above tissues or cells, particularly of the central nervous and pulmonary systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g. /cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

25 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

. ' The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating or diagnosing disorders associated 
with abnormal proliferation of cells in the Central nervous system and developing lung. 

30' ' ~ ^ * ' " 

FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

This gene is expressed primarily in breast lymph node and to a lesser extent in 
ovarian cancer and chondrosarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune responses such as inflammation or immune surveillance for 
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tumors. This gene may be important for inflammatory responses associated with 
tumors. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or ceil 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
5 immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression. level, i.e., the expression level in healthy tissue or bodily fluid from an 
10 individual not having the disorder. Preferred epitopes include those comprising a 

sequence shown in SEQ ID NO: 236 as residues: Lys-45 to Val-50, Lys-69 to Arg-76. 

The tissue distribution indicates that polynucleotides and polypeptides - 
corresponding to this gene are useful for treatment or diagnosis of immune responses 
including those associated with tumor-induced inflammation. 



15 



FEATURES OF PROTEIN ENCODED BY GENE NO: 4 



This, gene is expressed primarily in T-cells and T-ceil lymphomas. 
Therefore, polynucleotides and polypeptides of the invention are usefuhas 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions," which include, but are 
not limited to, immunilogical diseases involving T-cells such as inflammation, 
autoimmunity, and cancers including T-cell lymphomas. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders . 

25 of the above tissues or cells, particularly of T-cells and other cells of the immune 

system, expression of this gene at significantly higher or lower. levels may be routinely- 
detected in-certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spina] fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 

30 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypepudes 
corresponding to this gene are useful for diagnosing and treating T-cell based disorders 
such as inflammatory' diseases, autoimmune disease and tumors- including T-cell 

35 lymphomas. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

< 

This gene is expressed primarily in activated monocytes. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation, autoimmunity, infection, or disorders involving activation 
of monocytes- Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in. providing immunological probes for differential identification of che cissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
10 the immune system, expression of this gene at significantly higher or lower levels mav 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma,. urine, synovial" fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluidirom an 
15 individual not having the disorder. Preferred epitopes include those comprising a 
.... sequence. shown in SEQ ID NO: 238 as residues: Asp- 19 to Arg-31. ----- 
The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosing or treating diseases that result in 
activation of monocytes including infections, inflammatory responses or autoimmune 
20 diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

The translation product of this gene shares sequence homology with terminal 
deoxynucleotidyltransferase which is thought to be important in catalyzing the 
25 elongation of oligo- or polydeoxynucleotide chains. 

This gene is expressed primarily in activated human neutrophils. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
30 not limited to, cancer, particularly those of the blood such as leukemia and deficiencies- 
in neutrophils such as neutropenia. Similarly, polypeptides and antibodies directed to - 
these polypepddes are useful in providing immunological probes for differential 
' identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the cardiovascular system, expression of this gene at 
35 significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample -taken from an individual havins ' 
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such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to terminal deoxynucleotidyltransferase 
indicates that polynucleotides and polypeptides corresponding to -this gene are useful for 
treatment and differential diagnosis of acute leukemia's. Alternatively, this gene may 
function in the proliferation of neutrophils and be useful as a treatment for neutropenia, 
for example, following neutropenia as a result of chemotherapy. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

The contig exhibits a reasonable homology to the human chorionic gonadotropic 
(HCG) analogue-GT beta-subunit as disclosed in U.S. Patent No. 5,508^261 and PCT 
Publication No. WO 92/22568. There is a high degree of conservation of the 
structurally important cysteine residues in these identities. 

This gene is expressed primarily in IL-1 and LPS induced neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis.pt" diseases and conditions, which include, but are 
-not limited to, diseases of the immune system, including inflammatory diseases and 
allergies. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily- 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of diseases of the immune 
system since expression is primarily in neutrophils, and may be useful as a growth 
factor for the differentiation or proliferation of neutrophils for the, treatment of 
neutropenia following chemotherapy. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

This gene is expressed primarily in IL-1- and LPS-induced neutrophils. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a*: 
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biological sample and for diagnosis of diseases and conditions, which include,. but are 
not limited to, diseases of the immune system, including inflammatory diseases and 
allergies. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

5 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
ceil sample taken from an individual having such a disorder, relative to the standard 

10 gene expression level, i.e., the'expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 241 as residues: Ser-14 to Pro-22, Leu-43 to Vai-53. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of diseases of the 

15 immune system since expression is primarily in neutrophils, and may be useful as a 
growth factor for the differentiation or proliferation of neutrophils for the treatment of 
neutropenia following chemotherapy. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

20 This gene is expressed primarily in IL-1 and LPS induced neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the immune system, including inflammatory diseases and 

25 allergies. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues \z.g., cancerous and wounded tissues) or bodily 

30 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the. disorder. Preferred epitopes include those comprising a 
• sequence shown in SEQ ID NO: 242 as residues: Tyr-22 to His-35. 

35 The tissue" distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treatment/diagnosis of -diseases of the immune 
system since expression is primarily in neutrophils, and may be useful as a growth 
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factor for the differentiation or proliferation of neutrophils for the treatment of 
neutropenia following chemotherapy. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

5 This gene is expressed primarily in activated T-cells and to a lesser extent in 

endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, immune dysfunctions including cancer of the T lymphocytes and 
autoimmune disorders and inflammation. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene at 

15 ' significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

20 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are' useful for treatment and diagnosis of immune disorders 
particularly of T-celi origin and may act as a growth factor for particular subsets of T- 
cells such as CD4 positive cells which would make this a useful therapeutic for the 
treatment of HIV and other immune compromising illnesses. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 11 
This gene is expressed primarily in fecal tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of many developmental abnormalities. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or ceils, particularly of the developing fetus, 
expression of this gene at significantly higher or lower levels may be routinely detected 

35 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression leveL i.e.,. 
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the expression level in healthy tissue or bodily fluid from an individual not having, the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful as a growth factor or differentiation factor for 
5 particular cell types in the developing fetus and may be useful in replacement or other 
types of therapy in cases where the gene is expressed aberrantly. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

This gene is expressed primarily in T-cells and to a lesser extent in tumor tissue 

10 .including glioblastoma, meningioma, and Wilm's tumor. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
-reagents for differential identification of the tissue(s) or cell- type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the immune system including autoimmune conditions such as 

15 rheumatoid arthritis, inflammatory disorders and cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

20 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, Le. f the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 245 as residues: 

25 Thr-9 to Ser-14. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis/ modulation of immune function 
disorders, including rheumatoid arthritis and inflammatory responses, 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 13 

This gene is expressed primarily in placenta and to a lesser extent in fetal liver 
and bone marrow. , ; 

Therefore, polynucleotides and polypeptides of the invention are useful as 
< reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of hematological disorders. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
'probes for differential identification of the tissue(s) or cell type(s). For a number of ' 
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.disorders of the above tissues or cells, particularly of the hematological and immune 
systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
5 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an • 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful as a growth factor for hematapoietic stem cells or 
10 progenitor cells in the treatment of chemotherapy patients or kidney disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

This gene is expressed primarily in stromal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
15 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of hematapoietic disorders including cancer, 
■ neutropenia, anemia, and thrombocytopenia. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
20 the above tissues or cells, particularly of the hematapoietic and immune, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g.. cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an, 
individual having such a disorder, relative to the standard gene expression level, i.e., 
25 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful as a growth factor for hematapoietic stem cells or 
progenitor cells, in particular following chemotherapy treatment. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

The translation product of this gene shares sequence homology with epsilon- 
COP from Bos taurus which is thought to be important as a component of coatomer, a 
complex of seven proteins, that is the major component of the non-clathrin membrane 
35 coat. Preferred polypeptides encoded by this gene comprise the following amino acid 
sequences: * 
MAPPAPGPASGGSGEVDELFDVKNAFYIGSYQ 



WO 98/54963 




PCT/US98/11422 



I 4 

VFLYRAYLAQRIGFGVVL^ 
MSRSXDVTNTTFLLNLAASI^ 
RLDLxAJRXELKR^MQDLDEDATLTQLATAWVSLA 
PTLLLLNGQAACHMAQGRWEAAEG^ 
. 5 , PEVTNR^XSQLKDAHRSHPFIKEYQAKENDFDRLVLQYAPSAEAGPELSGP 

(SEQ ID NO:458); or RDVERDVFLYPL^YLAQRKFGWLDEIKPSS APELQAVRMF 

ADYLAFESRRDSrVAELDREMSRSXDVTO 

QGDSLECTANrWQIIXKLD^ 

GEKLQDAYYIFQEMADKCSPTLLIJ^ 

10 SGYPETLVNLIVLSQHLGKPPEVTNR^XSQLKDAHRSHPFIKEYQAKENDFDRL 
VLQYAPSA (SEQ ID NO:459). 

This gene is expressed primarily in activated monocytes and T -cells, and to a 
lesser extent in multiple other tissues. 

Therefore, polynucleotides and polypeptides of t'he invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immunomodulatory specifically relating to transport problems in these 
■ ceils. Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 

20 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 

25 expression level, i.e., the expression level in healthy, tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to epsilon-COP indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for treating 
/diagnosing problems with the cellular transport of proteins that may result in 

30 immunologic dysfunction.' 

FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

The translation product of this gene shares sequence homology wich an RNA. 
helicase which is thought to be important in polynucleotide metabolism. The translation - 
35 product of this contig exhibits good homology to the LbeIF4A antigen of Leishmania 
braziliensis. The LbeDF4A antigen, or immunogenic portions of it, can be used to 
induce protective immunity against leishmaniasis, specifically L. donovani, L. chagasi. 
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L. infantum, L. major, L. braziiiensis, L. panamensis, L. tropica and L. guyanensis. It 
can also be used diagnostically to detect Leishmania infection or to stimulace a cellular 
and/or humoral immune response or to stimulate the production of interleukin-12. 
This gene is expressed primarily in colon cancer and to a lesser extent in 
5 pituitary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceil type(s) present in a 
biological sample and for diagnosis of cancers particularly of the colon. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

10 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or ceils, particularly of the gastrointestinal 
system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 

15 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual' not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 249 as residues: Glu-93 to AIa-98, Gln-150 to Leu- 
156, Leu-220 to Leu-231, Leu-268 to Arg-273, VaI-324 to Pro-341, Arg-372 to Asn- ■ 

20 380, Ser-405 to Gly-410, Phe-426 to Ala-433, Glu-458 to Asp-470, Arg-506 to Ser- 
547. 

The tissue distribution and homology to RNA helicase indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for development- 
of diagnostic tests for colon cancer. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

The translation product of this contig has sequence homology to a cytoplasmic 

protein that binds specifically to JNK designated the JNK interacting protein- 1 or JIP-1 

in mice. JIP-1 caused cytoplasmic retention of JNK and inhibition of JNK-regulated 
30 gene expression. 

This gene is expressed primarily in brain including pituitary cerebellum frontal 

cortex, fetal brain and to a lesser extent in the kidney cortex. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification,of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of the central nervous system disorders including 

ischemia, epilepsy, Parkinsons disease, and schizophrenia. Similarly, polypeptides 

and antibodies directed to these polypeptides are useful in providing immunological \ 



WO 98/54963 



PCT/US98/H422 



probes for differencial identification of the tissue(s)-or celJ type(s). For a number of 
. disorders of the above tissues or cells, particularly of the central nervous system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
5 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Furthermore, the translation product of this contig may suppress the effects of 
the JNK signaling pathway on cellular proliferation, including transformation by the 
10- Bcr-Abl oncogene. Preferred; epitopes include those comprising a sequence ^shown in 
SEQ ID NO: 250 as residues: Pro-6 to Ser-26, Ala- 30 to Asp-41, Gly-55 to Ser-61, 
Gly-74 to Thr-80, Tyr-1 17 to Ala- 123, Tyr-167 to Asp-172, AIa-212 to Cys-223, Pro- . 
239 to Tyr-244. 

The tissue distribution indicates that polynucleotides and polypeptides 
15 corresponding to this gene are useful for enhanced survival and/or differentiation of 
neurons as a treatment for neurodegenerative disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: IS 

The translation product of this gene shares sequence homology with a liver 

20 stage antigen from a protozoan parasite. 

This gene is expressed primarily in fetal tissue and to a lesser extent in activated 
T-cells and other immune cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceil type(s) present in a 

25 biological sample and-for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities and diseases of immune function. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 

30 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from ' 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

35 disorder. 
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The tissue distribution and homology to a protozoan antigen indicates that 
polynucleotides and polypeptides corresponding to this gene are useful' for 
treatment/immune modulation of parasitic infections. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

Preferred polypeptide encoded by this gene comprise the following polypeptide 
sequences: 

MK^GIEPSLATYHHnRLFDQPGDPLKRSSniYDLMNELMGKRFSPKD 

PDDDK^QSANISICSSLRDLELAYQVHGLLKTGDNWKFIGPDQHRNFYYSKFF 

DUCLMEQmVTLKWYED 

(SEQ ID NO.460); and/or KDSKEYGHTFRSDLREEILMLMARDKHPPELQVAF 
ADCAADIKSAYESQPIRQTAQDWPATSLN 

NKIPRSELLNELMDSAKVSNSPSQAIEVVELASAFSLPICEGLTQRVMSDFAINQ 
EQKEALSNLTALTSDSDTDSSSDSDSDTSEGK (SEQ ID NO:461). Polynucleotides 
encoding such polypeptides are also provided. 

This gene is expressed primarily in stromal and CD34 depleted bone marrow 
cells and to a lesser extent in tissue^ of embryonic origin. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of hematologic origin including cancers and immune 
dysfunction. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the hematapoietic and immune, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded - 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid. from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 252 as residues: Ser-28 to Gln-34. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful as a growth factor for hematopoietic stem ceils or 
progenitor cells which may be usefui in the treatment of chemotherapy patients 
suffering from neutropenia. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

Preferred polypeptide fragments can be found in an alternative open reading 
frame. These preferred polypeptides comprise the amino acid sequence: 
MSSDNESDIEDEDLKLELRRLRDm.mQDLQSRQKHEESLYTKLGKVPPAVI 
5 IPPAAPLSGRRRRPTKSKGSKSSRSSSLGNKSPQLSGNLSGQSAASVLHPQQTL 
HPPGNIPESGQNQLLQPLKPSPSSDNLYSAFTSDGAISVPSLSAPGQGTSSTNTV 
GATVNSQAAQAQPPAMTSSRKGTFTDDLH 

MNYEGPGMARKFSAPGQLCISMTSNLGGSAP1SAASATSLGHFTKSMCPPQQY 
GFPATPFGAQWSGTGGPAPQPLGQFQPVGTASLQNFNISNLQKSISNPPGSNL 

10 RTT (SEQ ID NO:462); IQDLQSRQKHEIESL\TKLGKVPPAV1IPPAAPLSGRRRR 
PTKSKGSKSSRSSSLGNKSPQLSGNLSGQSAASVLHPQQTLHPPGNIPESGQN 
QLLQPLKPSPSSDNLYSAFTSDGAISVPSLSAPGQGTSST (SEQ ID NO:463); 
TSDGAISVPSLSAPGQGTSSTNTVGATVNSQAAQAQPP.AAITSSRKGTFTDDLH 
(SEQ ID NO:464); KGHMNYEGPGMARKFSAPGQLaSMTSNLGGSAPISAAS 

15 ATSLGHFTK (SEQ ID NO:465); QPLKPSPSSDNLYSAFTSDGAISVPSLSAPG 
(SEQ ID NO:466).' Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 

This gene is expressed in fetal liver and tissues associated with the CNS. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, liver and CNS diseases. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differenual 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

25 tissues or cells, particularly of the liver and CNS, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

30 in healthy tissue or bodily fluid from an. individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 253 as residues: 
Gln-26 to Lys-34. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment for liver diseases such 

35 as hepatocellular carcinomas and diseases of the CNS. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

In an alternative reading frame, this gene shows sequence homology to two 
recently cloned genes, karyopherin beta 3 and Ran_GTP binding protein 5. (See 
Accession Nos: gil2 102696 and gnlIPIDIe328731.) The Ran_GTP binding protein is 
5 related to importin-beta, the key mediator of nuclear localization signal (NLS)- 

dependent nuclear transport. Based on homology, it is likely that this gene may activity 
similar to the RAN_GTP binding protein. Preferred polypeptide fragments comprise the 
amino acid sequence: VRVAAAESMXLIJ^CAXVRGPEY^ 
JGTEPDSDVLSEIMHSFAK (SEQ ID NO:467). Also preferred are polynucleotide 
10 fragments encoding these polypeptide fragments. 

This gene is expressed in thymus tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

15 not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification -of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

20 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e.,. the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for diagnosis and treatment for immune disorders. . 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

This gene is expressed primarily in prostate and osteoclastoma tissues. 

Preferred polypeptide fragments also comprise the amino acid sequence: 
30 MEINNQNCnyiDLVRTVNIENGVEGLLIFG.^PESWLIGVRCSSEPPIL\LLL^ 

AHSQKRRLDGWSFIRHLRVHYCVSLTIHFS (SEQ ID NO:468). Also preferred are 

polynucleotide sequences encoding this polypeptide fragment. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential idendfication of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions, which include, but are 

not limited to, bone and prostate diseases, and cancers, particularly of the bone and 

prostate. Similarly, polypepddes and antibodies directed to these polypeptides are 
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useful in providing immunological probes for differential identification of the.ussue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the bone and prostate systems, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
5 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

- another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 255 as residues: Met- 1 to SeM 1. 
10 . The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment for bone and prostate 

• disorders, especially cancers of those. systems.- 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

15 This gene shares sequence homology with the FK506-binding protein (FKBP- 

13) family, a known .cytosolic receptor for the immunosuppressants. Recently, another 
group has cloned a very similar gene 7 recognizing the homology to FK506-binding 
protein family, calling their gene FKBP23. (See Accession No. 2827255.)" 
This gene is expressed primarily in lymphoid tissues. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample, especially for those susceptible to immune suppressant therapies and 
for diagnosis of diseases and conditions, which include, but are not limited to, immune 
suppressant disorders. Similarly, polypeptides and antibodies directed to these 

25 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
. particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g. T cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

30 another tissue or cell sample taken from an' individual^ having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 256 as residues: Ala- 19 to Val-3 1, Arg- 
38 to Gly-49, Ala-61 to Lys-66, Tyr-68 to Pro-78, Gly-1 16 to Aia-121, Asp-154 to 

35 Ser-162, Glu-173 to Gin- 186, Phe-194 to Gly-203, Pro-207 to Val-212. 
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The tissue distribution and homology to FKBP-12 and -13 indicates chat 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment for immune suppressant disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

This gene is expressed primarily in the brain and in the retina. This gene maps 
to chromosome 8, and therefore can be used in linkage analysis as a marker for 
chromosome 8. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological and ocular associated disease states. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the disorders of the central 
nervous system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression* level in' healthy tissue or bodily fluid from an 
individual not. having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 257 as residues: Cys-34 to Asp-40. . 

The tissue distribution in retina indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and/or detection of eye disorders 
including blindness, color blindness, impaired vision, short and long sightedness, 
retinitis pigmentosa, retinitis proliferans, and retinoblastoma. Expression in the brain 
indicates a role in the is useful for the detection/treatment of neurodegenerative disease' 
states and behavioral disorders such as Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, schizophrenia, mania, dementia, paranoia, obsessive-compulsive 
disorder and panic disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 

This gene shows sequence homology to a newly identified class of proteins 
expressed in the nervous system, called stathmin family. (See Accession No. 2585991; 
see also Eur. J. Biochem. 248 (3), 794-806 (1997).) The stathmin family appears to be 
an ubiquitous phosphoprotein involved as a relay integrating various intracellular 
signaling pathways. These pathways affect cell proliferation and differentiation.- . 
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Preferred polypeptide fragments comprise the amino acid sequence: 
QDKHAEEVRKNKBLKEEASR (SEQ ID NO:469); QQDLSPWAAPVGCPLXXASX 
TCHXLPLSGCLRRQSXSLPVVAXLCFWFSCPLASLFVPGQPCVTCPFPSLPFQD 
KHAEEVRKNKELKEEASR (SEQ ID NO:470). Also preferred are the 
5 polynucleotide fragments encoding these polypeptide fragments. 
This gene is expressed primarily in brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the.tissue(s) or cell type(s) present in a 
■ biological sample and for diagnosis of diseases and conditions, which include, but are 
10 not limited to, neurological disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the central nervous system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
15 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or ceil sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
. in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates, that polynucleotides and polypeptides 
20 corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 
Disease, Huntintons Disease, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder and panic disorder. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

The polynucleotide sequence of this gene contains a domain similar to a Flt3 
ligand peptide. Preferred polypeptide fragments comprise the amino acid sequence: 
PTRCCTTQPCRSSARRPCWVPMVPSPEGREXQPTCPS (SEQ ID NO:471). Thus, 
this gene may have activity as binding to FIG receptors, a process known to promote 
30 angiogenesis and/or lymphangiogenesis. 

This gene is expressed in human, tonsil, and to a lesser extent in 
^ teratocarcinoma, placenta, colon carcinoma, and fetal kidney. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
; reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
35 and for diagnosis of diseases and conditions, which include, but are not limited to, 
diseases of the tonsil, as well as cancers, such as colon, reproductive, and kidney, 
cancers. Similarly, polypeptides and antibodies directed to these polypeptides are useful. 
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in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
tonsils, colon, reproductive organs, and kidneys, expression of this gene at 
significantly higher or tower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual havins 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in .healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 259 as residues: 
Pro-22 to Glu-33. 

The tissue distribution in tonsil and several cancers and fetal tissues. indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of diseases of the tonsil or colon, such as tonsillitis, 
inflammatory diseases involving nose and paranasal sinuses, especially during the 
infection of influenza, adenoviruses, parainfluenza, rhinoviruses. The gene may also be 
useful in the diagnosis and treatment of neoplasms of nasopharynx or colon origins. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

In an alternative reading frame exists a large open reading frame that encodes a 
preferred polypeptide. Preferred polypeptide fragments comprise the amino acid 
sequence: 

MKRSLNENSARSTAGCLPVPLFNQKKRNRQPLTSNPLKDDSGISTPSDNYDFP 

PLPTDVVAWEAVNPEX,\PVMKTVDTGQIPHSVSRPLRSQDSVFNSIQSNTGRSQ 

GGWSYRDGNKNTSLKTWXKNDFKPQCKRTNLV.ANDGKNSCPMSSGAQQQK 

QLRTPEPPNLSRNKETELLRQTHSSKISGCTMRGLDKNSALQTLKPNFQQNQY 

KXQMLDDIPEDNTTLKETSL^ 

FEVLAVLDSAVTPGPYYSKTFLM^ 

NYDQKKNffQCVSVRPASVSEQKTF^ (SEQ ID 

NO:472); SQDSVFNSIQSNTGRSQGGWSYRDGNKNTSLKTWXKNDFKPQCKR 

(SEQ ID NO:473); NKETELLRQTHSSKISGCTMRGLDKNS ALQTLKPNF (SEQ ID 

NO:474);SSLRHSAVIESMKY^T^HAQKTVLLFEVLAVLDSA 

(SEQ ID NO:475); and PRLIRGRVHRCVGNYDQKKNIFQCVS VRPASVSEQKT 

FQAFV (SEQ ID NO:476). 

This gene is expressed primarily in human testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
. reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include.vbut are 
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not limited to, male reproductive disorders, including cancer. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the male reproductive system, 
5 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids' (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

10 disorder. _ 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful, as a hormone with reproductive or other systemic . 
' ' functions; contraceptive development; male infertility of testicular causes, such as 
Kleinfelterfs syndrome, varicocele, orchitis; male sexual dysfunctions; testicular 

15 neoplasms; and inflammatory disorders such as epididymitis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 28 
This gene is expressed primarily in apoptotic T-cell. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

- biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases relating to T cells, as well as cancer in general. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell typefs). For 

25 a number of disorders of the above tissues or cells, particularly of the disorders of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

30 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene, are useful for immune disorders. Moreover, since the gene 
was isolated from an apoptotic cell and based on the understanding of the relationship 

35 of apoptosis and cancer, it is likely that this gene may play a role in the genesis, of 
cancer. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 29 
This gene is expressed primarily in human tonsils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, gastrointestinal disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

10 tissues or cells, particularly of the gastrointestinal system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine; synovial - 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

15 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of gastrointestinal 
diseases. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

The translation product of this gene shares sequence homology with C44C1.2 
gene product of Caenorhabditis elegans with unknown function. Preferred polypeptide . 
fragments comprise the amino acid sequence: 

GVFRPCVCGRPASLTCSPLDPEVGPYCDTPTMRTLFNLLWLALACSPVHTTLSK 

25 SDAKKAASKTLLEKSQFSDKPVQDR 

FAGDVLGYVTPWSHGYDVTKVFGSKFTQISPVWLQLKRRGmVlFEVTGLHD 
VDQGWMRAVRKHAKGLfflW KTVVQ V A 

KNQHFDGFVVEV\VNQ 
DQLGMFTHKEFEQLAPVLDGFSLV^ 

30 KXKWRTKSSWGSTSMXWTXRXPXDA^ 

PQ (SEQ ID NO:477); TCSPLDPEVGPYCDTPTMRTLFNLLWL.\LACSPVHTTLS 
(SEQ ID NO:478); LVWDLK.^SWLEHRSYCSAKARDRHFAGDVLG\"VTPW 
NSHGYDVTKVFGSKF (SEQ ID NO:479); REMFEVTGLHDVDQGWMPL^VRK 
■ HAKGLHr/PRLLFEDWTTODFRNVLDSEDE (SEQ ID NO:430); HFDGFVVEVW 

3 5 NQLLSQKR VGLIHMLTHL AE ALHQ ARLL ALL VIPP AITPGTDQLGM (SEQ ID 
NO:481); DGFSLMTYDYSTAHQPGPNAPLSWV^ 

GST (SEQ ID NO:482). Also preferred are polynucleotide fragments encoding these 
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polypepcide fragments. This gene maps to human chromosome 1 1 , and therefore is 
useful in linkage analysis as a marker for chromosome 11. 

This gene is expressed primarily in human T cells and to a lesser extent in 
- .human colon carcinoma. 

* 5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
, biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders and cancer. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 

10 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune and gastrointestinal systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

15 an individual having such a disorder, relative to the standard , gene expression level, i.e., 
.. the expression level in healthy tissue or bodily fluid from .an individual not having the 
disorder. Preferred epitopes include 'those comprising a sequence shown in SEQ ID 
NO: 263 as residues: Leu-2i to Ala-30, Ser-38 to Asp-47, Pro-87 to Asp-94, Leu-197 
to Thr-204, Pro-256 to Ser-262, Thr-277 to Arg-282, Thr-293 to Trp-303. 

20 ' The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the diagnosis and treatment of immune 
disorders and gastrointestinal diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 31 
25 The translation product of this gene shares sequence homology with Ribosomal 

protein Lll of Caenorhabditis elegans. (See Accession No. 156201.) Preferred 

polypeptide fragments comprise the amino acid sequence: 

ERGVSINQFCKEFNERTKDIKEGIPLPTKILVKP^ 

EKGARQTGKEVAGLVTLKHW 
30 GIRVVKDLSSEELAAF QKERAIFLAAQKEADLAAQEEAAKK (SEQ ID NO:483). 

Also preferred are polynucleotide fragments encoding these polypeptide fragments.. 

This gene is expressed in human embryo tissue and to a lesser extent in human 

epithelioid sarcoma and other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for identificadon of the tissue(s) or cell type(s) present in a biological sample 

and for diagnosis of diseases and conditions, which include, but are not limited to, 

development disorders and epithelial cell cancer. Similarly, polypeptides and antibodies 
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directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the embryonic and epithelial cell systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or anocher tissue or cell sample taken from 
an individual having such a disorder, relative to the standard. gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder.. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 264 as residues: Lys-34-to Gly-40. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of developmental 
disorders and epithelial cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 32 

This gene is expressed primarily in resting T cells. 

Therefore," poly nucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s). present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammatory and general immune disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, . expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression leveUi'.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of disorders of 
immune system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

This gene is believed to reside on chromosome 1, Accordingly, polynucleotides 

derived from this gene are useful in linkage analysis as chromosome 1 markers. 

This gene is expressed primarily in prostate and to a lesser extent in soares adult 

brain, human umbilical vein endothelial cells, and amniotic cells. 
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. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are . 
not limited to, prostate -related disorders. Similarly, polypeptides and antibodies 
5 directed to these polypeptides are useful in providing immunological probes, for 

differential identification of the ussue(s) or cell type(s). For a number of disorders of 
the above tissues or ceils, particularly of the urinary system and nervous system 
expression of this gene at significantly higher or lower level's may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
15 for the diagnosis and treatment of disorders of the urinary and nervous systems. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 34 
This gene shares sequence homology with R05G6.4 gene product. (See Accession No. 
gill 326333.) This gene also shares sequence homology with the cyclophilin-like protein 
20 CyP-60. (See Accession No. 1199598, see also Biochem. J. 314 (1), 313-319 
(1996).) Preferred polypeptide fragments comprise the amino acid sequence: 
AVYTYHEKKKDTAASGY 
YEREAILEYILHQKKELARQM 

SAIVSRP LNPFTAKALSGTSPDDVQPGPSVGPPSKDKDKVLPSFWIPSLTPEAK 
25 ATKLEKPSRTVTCPMSGKPLRNISDLTPVHFTPLDSSVDRVGLITRSERYVCAVT 
RDSLSNATPCAVLRPSGAVVTT^ECVEm 

(SEQ ED NO:484); YLYEREAILEYILHQKKEIARQMKAYEKQRGTRREEQKELQ 
RAASQDHVRGFLE (SEQ ID NO:485); and FTAKALSGTSPDDVQPGPSVGPP 
. S KD KD KVLPSFWIPSLTPE AKATKLEKPS RT VTCPMS GKPL (SEQ ID NO:486). 
30 Also preferred are polynucleotide fragments that encode these polypeptide fragments. 

This gene is expressed primarily in human testis and to a lesser extent in other 
tissues. 1 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a - 
35 biological sample and for diagnosis of diseases and conditions, which include,, but are 
not limited to,, male reproductive disorders and in particular testicular cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
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immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system. 
Expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum,- 
5 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are, useful for diagnosis and treatment of disorders of the ■ 
male reproductive, system and in particular of testicular cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 35 

The translation product of this gene shares sequence homology with Lpeop of 
15 Saccharomyces cerevisiae which is thought to be' important in the metabolism of 
phospholipids. 

This gene is expressed primarily in liver and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, metabolic disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above, 
tissues or cells, particularly of the metabolic and nervous systems expression of this 

25 gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal' fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
• expression level in healthy tissue or bodily fluid from an individual not having the . 

30 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 268 as residues: Pro-14 to Leu-20, Lys-23 to Asn-38, Arg-109 to Arg-1 14, Lys- 
119 to Asn-124, Glu-152 to Leu-157, Pro-172 to Val-180. 

The tissue distribution and homology to Lpe5p of Saccharomyces cerevisiae 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 

35 the diagnosis and treatment of metabolic and nervous disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 36 
This gene shares sequence homology with the nuclear ribonucieoprotetn U (HNRNP 
U), encoded by C. elegans (See Accession gil 1703576.) Preferred polypeptide 
fragments comprise the amino acid sequence: 

5 MDTSENRPENDWEPPMPIADQVSNDDRPEGSVEDEEKKESSLPKSFKRKISVV 
SATKGVPAGNSDTEGGQPGRKRRWGASTATTQKKPSte 
AGQEAVVDLHADDSRISEDETERNGDDGTHDKGLKICRTVTQVVP^GQENGQ 
REEEEEEKEPEAEPPWPQVSVEV.M.PPPAEHEYKKVTLGDTLTRRSISQQKSGV 
SmDDPVRTAQVPSPPRGKJSMVfflSNLVRPFTLGQLKELLGRTGTLVEEAFW 

10 DKIKSHCFVTYSTVEEAVATRTALHGVKWPQSNPKFLCADYAEQDELDYHRGL 
LVDRPSETKTEEQGIPRPLHPPPPPPVQPPQHPRAEQREQERAVREQWAERERE 
MERRERTRSEREWDRDKVREGPRSRSRSRXRRRJ^ 
EPPAKLLDDLFRKTKAAPCIYWLPLTO 
EQKEREKEAERERNRQLEREKRREHSRERDRERERERE 

15 RGRERDRRDTKRHSRSRSRSTPVRDRGGR (SEQ ID NO:488). Also preferred are 
the polynucleotide fragments encoding this polypeptide fragments. 
This gene is expressed primarily in epididymus. 
, Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the dssue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the male reproductive system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the male reproductive system, expression of 

25 this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
. individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

30 disorder. 

- The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of male 
reproductive disorders. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 37 
This gene is expressed primarily in amygdala. 
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Therefore, polynucleotides and polypeptides, of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present In a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited toj inflammatory diseases and reproductive disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the amygdala, 
expression of this gene at significantly higher or lower levels may be routinely. detected 
in certain tissues (e.g.. cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful' for diagnosis and treatment of inflammatory 
diseases and reproductive disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

This gene shares sequence homology with human opsonin protein P35 
fragment. (See Accession No. R9418L) The opsonin protein activates the phagocytosis 
of pathogenic microbes by phagocytic cells. Preferred polypeptide fragments comprise 
the amino acid sequence: GCDSCPPHLPREAFAQDTQAEGECSSRAERADMCPDAP 
PSQEVPEGPGAAP (SEQ ID NO:489). Also preferred are polynucleotide fragments 
encoding these polypeptide fragments. 

This gene is expressed in immune-related tissues such as. thymus, macrophage, 
T ceils and to a lesser extent in many other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential idendflcation of the dssue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders and infectious disease. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above dssues or cells, particularly of the immune system and infectious disease, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e. x g„ serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
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the expression level in healthy tissue or bodily fluid from an individual not haying the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 271 as residues: Lys-9 to Arg-14, MetoS to Asp-51. 

The tissue distribution indicates that polynucleotides and polypeptides 
5 corresponding to this gene are useful for diagnosis and treatment of immune disorders, 
as well as the treatment and/or diagnosis of infectious disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

The translation product of this gene shares sequence homology with alpha-2 
10 type I collagen which is thought to be important in tissue repair. (See, e.g.,. 21.1 607.) 
Preferred polypeptide fragments comprise the amino acid sequence: PQLPSCGRPW 
PGTAS VFQSHTQGPREDPDPCRAQGSAGTHCPISLSPPRQ (SEQ ID NO:49G). 
Also preferred are the polynucleotide sequences encoding these-polypeptide sequences. 
This gene is expressed primarily in the brain and to a lesser extent in the kidney 
15 and thymus 

Therefore, polynucleotides and polypeptides of .the invention are useful as 
reagents for differendal identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, brain, kidney, and immune disorders. Similarly, polypeptides and 

20 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the brain, kidney, and immune disorders, 
" expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

25 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken frona 
an'individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy dssue or bodily fluid from an individual not having the 
disorder. . 

The tissue distribution and homology to alpha-2 type I collagen indicates that 
30 polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of tissue repair, and brain, kidney, immune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

The translation product of this gene shares sequence homology with mini- 
35 collagen which is thought to be important in tissue repair tumor metastasis. (See 
Accession No. gnlIPED!di006976.) Preferred polypeptide fragments comprise the 
' amino acid sequence: PGFRGPS GSLGCSFFPRSLGRVLPPGCQRPG AH AD : 
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SSPPPTP (SEQ ID NO:49 1 ). Also preferred are polynucleotides encoding this 
polypeptide fragment. 

This gene is expressed in ovarian cancer and to a lesser extent in dedritic cells ■ 
and smooth muscle. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumor metastasis and tissue repair. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

10 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the tumor metastasis and tissue repair, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids .(e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

15 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 273 as residues: Asn-2 to His-1 1. 

The tissue distribution and homology to mini-collegen gene indicates that 

20 polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of tumor metastasis and tissue repair. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

This gene shares sequence homology with the HIV TAT protein. (See 

25 Accession No. 328416.>Preferred polypeptide fragments comprise the amino acid 
sequence: EDLKXPDPASLRAASCGEGKXEIKLACKNCTCGLAEELEKEK 
SREQMSSQPKSACGNCYLGDAFRCASCPYLGMPAFKPGEKVLLS (SEQ ID 
NO:492); EDLKKPDPASLRAASCGEGKKRKACKNCTCGLAEELEKEK 
SREQMSSQPKSACGNCYLGDAFRCASCP^'LGMPAFKPGEKVLLSDSNLHD 

30 (SEQ ID NO:493); CGNCYLGDAFRCASCPYLGMPAFKPGEKVLLSDS 

(SEQ ID NO:494); SCGEGKKRKACKNCTCGLAEELEKE (SEQ ID NO:495); 
SQPKSAC GNCYLGDAFRCASC (SEQ ID NO:496); and REAGQNSERQYVS 
LSRD (SEQ ID NO:497). Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 

35 This gene is expressed primarily in the infant brain and to a lesser extent in the 

breast and testes. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not. limited to, brain, testes and breast disorders. Similarly, polypeptides and antibodies 
5 . directed to these polypeptides are useful in. providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a. number of disorders of 
the above tissues or cells, particularly of the brain, testes and breast disorders, 
" expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from, an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
- NO: 274 as residues: Pro-7 to VaH5. 
15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of brain, testes and 
breast, and other related disorders. *' ' 

FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

20 This gene is expressed primarily in the infant brain, human cerebellum, and to a 

lesser extent in medulloblastoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, brain related disorders and medulloblastoma and other brain cancers. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues, or cells, particularly of the 
brain related disorders and brain cancers, including medulloblastoma, expression of this 

30 gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy dssue or bodily fluid from an individual not having the 

35 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 275 as residues: Thr-41 to Glu-47. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

This gene is expressed primarily in the fetal brain, cerebellum and to a lesser 
extent in the piacenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents, for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neuronal cell related disorders. Similarly, polypeptides- and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 

10 the above tissues or cells, particularly of the neuronal cell related disorders, expression . 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

15 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
iNO: 277 as residues: Thr-20 to GIy : 28. 

The tissue distribution and homology to pro line-rich protein genes indicates that- 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 

20 and treatment of neuronal cell related disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 45 

The translation product of this gene shares sequence homology with 

precerebellin of human, which is thought to be important in synaptic physiology. (See 
25 - Accession No. gii 180251.) It has been observed that cerebeilin-like immunoreactivity is 

associated with Purkinje cell postsynaptic structures. Thus, it is likely that this gene 

also have synaptic activity. Preferred polypeptide fragments comprise the amino acid 
' sequence: QEGSEPVLLEGECLVVCEPGRAAAGGPGGAALGEAPPGRVAFXAV 

RSHHHEPAGETGNGTSGAIYFDQVLVNEGGGFDRASGSFVAPVRGVYSFRFH 
30 WKVYNRQTVQVSLMLiN^ 

LRRGXSTGW (SEQ ID NO:499). Also preferred are polynucleotide 4 fragments 

encoding these polypeptide fragments. 

This gene is expressed primarily in cerebellum and infant brain. By Northern 

analysis, a single transcript of 2.4 kb was observed in brain dssues. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present' in a 

biological sample and for diagnosis of diseases and conditions, which include,- but are 
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The tissue distribution indicates that polynucleotides and polypeptides . 
corresponding to this gene are useful for diagnosis and treatment of human brain related 
disorders, brain cancers, and medulloblastoma. 



5 FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

The translation product of this gene shares sequence homology with a 
phosphotyrosine-independent ligand for the Ick SH2 domain which is thought to be 
important in signal transduction related to phosphotyrosine-independent ligand for the 
Ick SH2 domain. (See Accession No. gill 184951.) Preferred polypeptide fragments 

10 comprise the amino acid sequence: ESSGQARTLADPGPGWPRQQGMCFGSLT 

GLSTTPHGFLTVSAEADPRLIESLSQMLSMGFSDEGGWLTRLLQTKNYDIGAAL 
DTIQYSKH (SEQ ID NO:49S). Also preferred are polynucleotide fragments encoding 
this .polypeptide fragment. It is likely that this gene is a new member of a family of 
phosphotyrosine-independent ligands for the Ick SH2 domains. 

1 5 This gene is expressed primarily in the placenta and to a lesser extent in 

endothelial cells and neutrophil. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
. 20 not limited to, reproductive, cardiovascular, immune, and infectious diseases. 

Similarly, polypeptides and antibodies directed to these polypeptides are useful in . 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
cardiovascular, reproductive, and immune system, and infectious diseases, expression 

25 of this gene at significantly higher or lower levels may be routinely, detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such ajiisorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

30 disorder. 

The tissue distribution and homology to a phosphotyrosine-independent ligand 
for the Ick SH2 domain indicates that polynucleotides and polypeptides corresponding 
to this gene are useful for diagnosis and treatment of cardiovascular, reproductive, and 
immune system diseases, as well as infectious diseases. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 47 

The translation product of this gene shares sequence homology with a 12 kD 
nucleic acid binding protein of Feline calcivirus which is thought to be important in viral 
replication. (See Accession No. 59264) 
5 This gene is expressed primarily in human cardiomyopathy and to a lesser 

extent in T helper cells, fetal brain and synovial sarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, cardiomyopathy as well as viral infection. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or ceils, particularly of the cardiovascular system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 

15 tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 

20 NO: 280 as residues: Trp-2'0 to Cys-26. 

The tissue distribution in cardiomyopathy and homology to viral 12 kD nucleic 
acid binding protein indicates that polynucleotides and polypeptides corresponding to 
this gene are useful for diagnosis and intervention of cardiomyopathy, including those 
caused by ischemic, hypertensive, congenital, valvular, or pericardial abnormalities. 

25 The gene expression pattern may be the consequence or the cause for these conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 

The translation product of this gene shares sequence homology with tumor 
necrosis factor related gene product which is thought to be important in tumor necrosis, 
30 bacterial and viral infection, immune diseases and immunoreactions. 

This gene is expressed primarily in colon and to a lesser extent in ovarian and 
breast cancers. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a . 
35 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumors of colon, ovary or breast origins. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological' probes 
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noc iimiced to, neuronal cell signal transduction and synaptic physiology. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or ceil type(s). For 
a number of disorders of the above tissues or cells, particularly of the neuronal cell • 
5 signal transduction and synaptic physiology expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or "another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
10 or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to gene or gene family indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of neuronal cell related disorders. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

This gene is expressed in fetal liver and spleen, and to a lesser extent in bone 
marrow, umbilical vein, and T cells:' 

Therefore, polynucleotides and polypeptides of the invention are useful ^as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders of the immune system, particularly hematopoiesis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the hematopoiesis 

25 and immune disorders, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken' from an individual having such a disorder, relative to, the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. Preferred epitopes, include those 
comprising a sequence shown in SEQ ID NO: 279 as residues: Asp-30 to Glu-57. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of hematopieotic and 
immune disorders. 



35 
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for differencial identification of the tissue(s) or cell cype(s). For a number of disorders 
of the above tissues or cells, particularly of the colon, ovary and breast, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not. having the 
disorder. 

The tissue distribution and homology to Tumor necrosis factors indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for intervention 
of cancers of colon, ovary 'and breast origins, because TNF family members are known 
to be involved in the tumor development. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

The translation product of this gene shares sequence homology with mucins, 
such as epithelial mucin, which is thought to be important in extracellular matrix 
functions such as protection, lubrication and cell adhesion (See for example Accession 
No. R68002). Preferred polypeptide fragments comprise the following amino acid 
sequence: PRSRPALRPGRQRPPSHSATSGVLRPRKKPDP (SEQ ID NO: 500). 
Also preferred are polynucleotide fragments encoding these polypeptide fragments. 
Moreover, this gene maps to chromosome 22ql 1.2-qter, and therefore, can be used as 
a marker in linkage analysis for chromosome 22. 

This gene is expressed primarily in corpus colosum. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumors, especially of corpus colosum, as well as metastatic lesions. 
Similarly, poiypepddes and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above dssues or cells, particularly of the 
corpus colosum and other solid tissues, expression of this gene at significandy higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, pLasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell, sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 
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The tissue distribution and homology to mucins indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for serum tumor markers or 
immunotherapy targets because tumor cells have greatly elevated level of mucin 
expression and shed the molecules into the epithelial v tissues. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

This gene is expressed primarily in CD34 depleted buffyroat cord blood and 
primary dendritic cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as • 

10 - reagents for differential identification of the tissue(s) or.cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, hematopoietic disorders and immunological disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 

15 a number of disorders of the above tissues or cells, particularly of the hematopoietic and 
immune systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

20 ^ gene expressio n level, i-^^ihe^^ 
. individual not having the disorder. 

The tissue distribution in CD34 depleted buffy coat cord blood and primary 
dendritic cells indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for diagnosis and treatment of hematopoietic and immune disorders. 

25 ; Secreted or cell surface proteins in the above tissue distribution often are involved in 
cell activation (e.g. cytokines) or. molecules involved in cell surface activation. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

The translation product of this gene shares sequence homology with Interferon 
30. induced 1-8 gene encoded polypeptide which is thought to be important in binding to 
retroviral rev responsive element. Preferred polypeptide fragment comprise the 
following amino acid sequences: MTLITPSXKLTFXKGNKSWSSRACSSTLVDP 
(SEQ ID NO:501).. Also preferred are polynucleotide fragments'encoding these 
polypeptide fragments. ( ' 

35 This gene is expressed primarily in CD34 positive cells and neutrophils. 

. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification- of the tissue(s) or cell type(s) present in a" 
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biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, retroviral infection, such as AIDS, and other immune disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or ceil 
5 type(s). For a number of disorders;of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

10 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having -the disorder. Preferred epitopes include those comprising a 
sequence shown in, SEQ ID NO: 284 as residues: Gin-5 1 to Trp-62. 

The tissue distribution and homology to interferon induced gene 1-8 indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 

15 . intervention of retroviral infection including HIV. The factor may be involved in viral 
stability or viral entry into the cells. Alternatively, the virus/factor complex may elicit 
the cellular immune reaction. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

20 This gene shares sequence homology to immunoglobulin lambda chain (See 

Accession No. 2865484). Therefore it is likely that this gene has activity similar to an 
immunoglobulin lambda chain. Preferred polypeptide fragments comprise the following 
amino acid sequence: GHPSP.ALSIAPSDGSQLPCDEVPYGEAHVTRYCKKPLTNS 
HLETEAQSSSL (SEQ ID NO:502). Also preferred are polynucleotide fragments 

25 encoding these polypeptide fragments. 

This gene is expressed primarily in Hodgkin's lymphoma. 
Therefore, polynucleoddes and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

30 not limited to, Hodgkin's lymphoma and other immune disorders. Similarly, 

polypeptides and antibodies directed to these polypeptides are useful in providing . 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 

35 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid from an individual not haying the 
• disorder.. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 285 as residues: Pro-27 to Thr-32. 

The tissue distribution in Hodgkin's lymphoma and the sequence homology 
5 indicates that polynucleotides and polypeptides corresponding to this gene are useful, for 
diagnosis of Hodgkin's lymphoma, since the elevated expression and secretion by the 
tumor mass may be indicative of tumors of this type. Additionally the gene product may 
be used as a target in the immunotherapy of the cancer.Because the gene is expressed in 
cells of lymphoid origin, the natural gene product may be involved in immune 
10 .functions. Therefore it may be also used as an agent for immunological disorders 

including arthritis, asthma, immune deficiency diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

This gene has extensive homology to cDNA for Homo sapiens mRNA for the 

15 ISLR,gene(See Accession No. AB003184). This protein is considered to be anew 

. . — member of the Ig superfamily and contains a leucine-rich repeat (LRR) with conserved* 

flanking sequences and a C2-type immunoglobulin (Ig)-like domain. These domains are 
important for protein-protein interaction or cell adhesion, and therefore it is possible that 
the novel protein ISLR may also interact with other proteins or cells. The ISLR gene 

20 was mapped on human chromosome 15q23-q24 by fluorescence in situ hybridization 
(See Medline Article No. 97468 140). Homology to the ISLR gene has been confirmed 
by another independent group as well (See Accession No. Hs. 102 171) 

This gene is expressed in a number of tissues including human retina, heart, 
skeletal muscle, prostate, ovary, small intestine, thyroid, adrenal cortex, testis, 

25 stomach, spinal cord, fetal lung and fetal kidney tissues, colon, tonsil and stomach, 
cancer, and to a lesser extent in endometrial stromal cells treated with estradiol, breast 
tissue, synovium, lymphoma, and number of other tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumors of colon, ovary and breast origins. However, due to the wide 
range of expression in various tissues, protein may play a vital role in the development 
of cancer in other tissues as well, not just those mentioned above. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

35 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the colon, ovary and 
-breast, expression of this gene at significantly higher or lower levels may be routinely 
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detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
5 individual not having the disorder. Additionally, this gene maps to chromosome 15q23- 
q24,'and therefore, can be used as a marker in linkage analysis for chromosome 15. 

The tissue distribution in tumors of colon, ovary, and breast origins indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and intervention of these tumors, in addition to other tumors where 
10 expression has been indicated.^ Protein, as well as, antibodies jdirecied against the 

protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tumors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

15 . Gene has homology to multidrug resistance gene 1 (See Accession No. 
P06795); Preferred polynucleotide fragments comprise the following sequence: 
GCTTCGTGTCCAACCCTCTTGCCCTTCGCCTGTGTGCCTGGAGCCAGTCCCA 
CC ACGCTCGCGTTTCCTCCTGT AGTGCTC AC AGGTCCC AGCACCG ATGGCA 
TTCCCTTTGCCCTGAGTCTGCAGCGGGTCCCTTTTGTGCTTCCTTCCCCTCA 

20 GGTAGCCTCTCTCCCCCTGGGCCACTCCCGGGGGTGAGGGGGTTACCCCTT 
CCC AGTG 1 TTTI T ATTCCTGTGGGGCTC ACCCC AAAGT ATT AAAAGTAGCT^ 
GTAA (SEQ ID NO:503). Also preferred are polypeptide fragments encoded by these 
polynucleotide fragments. 

This gene is expressed primarily in lung, esophagus, leukemia (Jurkat cells) and 

25 breast cancers and to a lesser extent in macrophages treated with GM-CSF fetal tissues 
and wide range of tissues. 

■ Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

30 not limited to, cancer of wide range of origins. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the solid tumors, lung and leukemia,, 
expression of this gene at significantly higher or lower levels may be routinely detected 

35 in* certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Furthermore, due to the high expression level in lung tissue and the proposed 
function of the multidrug resistance protein 1 gene as the efflux pump responsible for 
low-drug accumulation in multidrug-resistant cells, protein as well mutants thereof,, 
5 may also be beneficial as a target for gene therapy, particularly for the chronic patient. 
Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 287 as 
residues: Met- 1 to Lys- 16. 

The tissue distribution' in wide range of cancers and fetal tissues indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for detection of 
10 cells in active proliferation, such as cancers. The gene products may be used for cancer 
markers or immunotherapy target. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 55 
This gene maps to, the X chromosome. 

15 This gene is expressed primarily in the brain and to a lesser extent in the 

developing embryo. 

Therefore, polynucleotides 'and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and~for diagnosis of diseases and conditions, which include, but are 

20 not limited to, neurodegenerative disease states and developmental disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders, including sex-linked disorders, of the above tissues or cells, 
particularly of the neurological, developmental systems, and cardiovascular system, 

25 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

30 disorder. Moreover, this gene maps to the X chromosome, and therefore, may be used 
as a marker in linkage analysis for this chromosome. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 

35 Disease, Huntington's Disease, Klinefelter's, schizophrenia, mania, dementia, 

paranoia, obsessive compulsive disorder and panic disorder. In addition, the gene or 
gene product may also play a role in the treatment and/or detection of developmental ' • 
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disorders associated with the developing embryo, sexually-linked disorders, or 
disorders of the cardiovascular system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 56 
5 The translation product of this gene shares sequence homology with paxillin 

which is thought to be important in mediating signal transduction from growth factor 
receptors to the cytoskeleton. Preferred polynucleotide fragments comprise the 
following sequence: TGGCTCACTGTCTTACAATCACTGCTGTGGAATCATGA 
TACCACTTTTAGCTCTTTGCATCTTCCTTCAGTGTAT1 1 1 " 1 'GTTTTTC AAG AGG 
1 0 AAGTAGATTTTAACTGGACAACTTTGAGTACTG 

GGGTTGTGGTTTCAA (SEQ ID NO:506). Also preferred are polypeptide fragments 
encoded by these polynucleotide fragments. More preferably, polypeptide fragments 
comprise the' amino acid sequence: 7 LDELMAHLTEMQ AKV AVRAD 
AGKKHLPDKQDHIC^SLDSN'ILGGLEQELQDLGIATVPKGHCASCQKPUGKVI 
15 HALGQSWHPEHFVCTHCKEEIGSSPFFERSGLXYCPNDYHQLFSPRCAYCAAP 
ILDKVLTAMNQTWHPEHFFCSHCGEVFG^GFHEKDKKPYCRKDFLAMFSPK 
CGGCNRPVLENYLSAMDTVWHPECFVC 

HRRGTLCHGGGQPITGRCISAMGYKFHPEHFVCAFCLTQLSKGIFREQNDKTY 
CQPCFNKLF (SEQ ID NO:507); KLASLDSMLGGLEQELQDLGIATVPKGHC 

20 ASCQKPIAGKVIHAL (SEQ ID NO:508); CPNDYHQLFSPRCAYCAAPILDKVL 
TAMNQTWHPEHFFCSHCGEVFGAEG (SEQ ID NO:509); DKKPYCRKDFLAM 
FSPKCGGCNRPVLENYLSAMDTVWHPECFVCGDCFTSFSTGSFFELDGRPFCE 
L (SEQ ID NO:5 10); CGQPITGRCISAMGYKFHPEHFVC AFCLTQLSKGIFRE 
QNDKTYCQ (SEQ ID NO:51 1). Polynucleotide fragments encoding these preferred 

25 polypeptide fragments are also contemplated. . 

This gene is expressed primarily in brain, and to a lesser extent in the 
developing embryo. 

Therefore, polynucleotides and polypeptides of the invention are- useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological disease states* and developmental abnormalities. Similarly, 
polypeptides and antibodies directed to. these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(.s). For 
a number of disorders of the above tissues or cells; particularly of the immune and 

35 nervous systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum ; plasma, urine, synovial fluid or spinal fluid) or another tissue or 
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cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Moreover, since this gene shares homology with a 
gene that maps to chromosome 11, (See Accession No.T37404), gene as, well as its 
5 translated product may be used for linkage analysis on chromosome 1 1 . 

. The tissue distribution and homology to paxillin indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the treatment and or detection 
. . of disease states associated with abnormal signal transduction in brain and/or the 
developing embryo. This would include treatment or detection of neurodegenerative 
10 disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 

Disease, Huntingtons Disease,, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder and panic disorder and also in the treatment and or detection of 
embryonic development defects. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 57 

This gene is expressed primarily in fetal spleen, brain, and to a lesser extent in 
six week old embryo. 

. Therefore, polynucleotides and polypeptides of the invention are useful as 
" reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders, neurological disorders, and developmental 
abnormalities. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the ussue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

25 the immune and developmental systems, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine', synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 

30 or bodily fluid from an individual not having the disorder. Preferred epitopes include 

those comprising a sequence shown in SEQ ID NO: 290 as residues: Arg-28 to Gly-34. 

The expression of this gene in fetal spleen indicates that polynucleotides and . 
polypeptides '^corresponding to this gene are useful for treatment/detection of immune 
disorders such as arthritis, asthma, immune deficiency diseases such as AIDS, and 

35 leukemia. In addition the expression of this gene in the early embryo, indicates a key 
role in embryo development and hence the- gene or gene product could be used in the 
treatment and or detection of embryonic development defects. This would include 
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treatment or detection of neurodegenerative disease states and behavioral disorders such 
as Alzheimer's Disease, Parkinson's Disease, Huntintons Disease, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder and panic disorder and also 
in the treatment and or detection of embryonic development defects. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 58 
The translation product of this gene shares sequence homology with the gene disrupted 
in the neurodegenerative disease dentatorubal-pallidoluysian atrophy. Moreover a long 
open reading fame exists in an alternative frame. Preferred polypeptide fragments 
10 comprise the following: 

mgssqsveipgggtegyhvlrvqenspghraglepffdfivsingsrlnkdnd 
tlkdllkxn^ekpvkiviliyssktlelretsvtpsnlwggqgllgvsirfcsfd 
ganenvvvtivlevesnspaalaglrphsdyiigadtvmnesedlfsliettieakp 
lkl y v wtdtdncre viitpns a wggegslgcgig yg ylhriptrpfeeg kkis 

15 lpgqmagtpitplkdgftevqlssvnppslsppgttgieqsltglsisstppavss 
vlstgvptvpllppqvnqsltsvppmnpattlpgliviplpaglpnlpnlnlnlpa 
phimpgvglpelvnpglpplpsmpprnlpgiaplplpseflpsfplvpesssaass 
gellsslpptsnapsdpatttakadaassltvdvtpptak-apttvedrvgdstpv 
sekpvsaavdanasesp (seq id no:512); s veipgggteg yhvlr vqens pgh ' 

20 raglepffdftvslngsrlnkdndtlkdllkxnvekpvk.mliyssktlelrets 
ytrpsnlwggqgllgvsirfcsfdganenvwh (seq id no:513); esnspaa—*^ 
laglrphsdyngadtv^esedlfsletheakplklyvyntdtdncrevittp 
nsawggegslgcgigygylhriptrpfeegkkislpgqmagtpitplkdgftev 
qls s vnppsls ppgttgieqs ltg lsiss (seq id no:5 14); riptrpfeegkki 

25 slpgqmagtpitplkdgftevqlssvnppslsppgttgieqsltglsisstppavs 
svlstgvptvpllppqvnqsltsvppninpattlpglmplpaglpnlpnlnlnlp 
aphiimpgvglpelvnpglpplpsmpprn (seq id no:5l6); pglpplpsmpprn 
lpgiarlplpseflpsfplvpesssaassgellsslpptsnapsdpatttakadaa 
ssltvdvtpptajcapttvedrvgdstpvsekpvsa.a.vd.an (seq id no:5 17). ■■ 

30 This gene is expressed primarily in prostate cancer, and to a lesser extent in the 

pineal glands and in fetal lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

35 not limited to, neurological conditions and pulmonary disorders. Similarly, , 

polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
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a number of disorders of the above tissues or ceils, particularly of the nervous, 
pulmonary, and endocrine systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 291 as residues: Asn-9 to Leu- 14. 

The abundance of this gene in the pineal gland and its homology to a gene 
disrupted in the neurodegenerative disease state Dentatorubral-pallidoluysian atrophy 
indicates that this gene may be useful in the treatment and/or detection of other 
neurodegenerative disease states and behavioral disorders such as Alzheimer's Disease, 
Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder and panic disorder. The abundance of this gene in fetal 
lung would suggest that misregulation of the expression of this protein product in the 
adult could lead to lymphoma or sarcoma formation, particularly in the lung; that it may 
also be involved in predisposition to certain pulmonary defects such as pulmonary 
edema and embolism, bronchitis and cystic fibrosis; and thus the gen or the gene 
protein encoded by the gene could be used in the detection and/or treatment of these 
pulmonary disorders. . 

FEATURES OF PROTEIN ENCODED BY GENE NO: 59 
This gene is expressed primarily in the developing embryo. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of. 
the above tissues or cells, particularly of the developmental system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual - 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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The expression of this gene primarily in che. embryo, indicates the gene plays a 
key role in embryo development and that the gene or the protein encoded by the gene 
could be used in the treatment and or detection of developmental defects in the embryo 
or in infants. ■ 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

This gene displays homology to nestin, an intermediate filament protein, the 
expression of which correlates with, the proliferation of Central Nervous System 
progenitor cells and that is useful in the identification of brain tumors. This gene maps 

10 to chromosome 1, and therefore, may be used as a marker in linkage analysis for 
chromosome 1 (See Accession No. AA527343). 

This gene is expressed primarily in kidney and to a lesser extent in brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, renal disorders and "neurodegenerative conditions. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the excretory and 

20 nervous systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy. tissue or bodily fluid from an 

25 individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 293 as residues: Thr-128 to Asn-135. 

The tissue distribution and homology to nestin indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for detection and/or treatment of 
neurodegenerative disease states and behavioral disorders such as Alzheimer's Disease, 

30 Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder and panic disorder. In addition, its abundance in kidney 
indicates that it is useful in the treatment and detection of acute renal failure and other 
disease states associated.with the kidney. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 61 

Gene shares homology with the latrophilin-related protein 1 precursor as well as 
the calcium-independent alpha-latrotoxin receptor. Preferred polypeptide fragments • 
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comprise the following amino acid sequence: 

IYKVFRHTAGLKPEVSCFENIRSCARXXXXXXXXXXXXWIFGVLHVVHASVV 
TA^XFTVSNAFQGMFIFLFLCVLSRKIQEEYYRLFKNVPCC (SEQ ID NO.513); 
WffGVLHWHASVVTA 
5 - C (SEQ ID NO:519). Also preferred are polynucleotide fragments encoding these' 
polypeptide fragments. (See Accession No/22 13659) The translation product of this 
gene shares sequence homology .with CD 97, a seven transmembrane bound receptor. 
. This gene is expressed primarily in infant brain and in endothelial cells. . 
Therefore, polynucleotides and polypeptides of the invention are useful as 
10 reagents for differential identification of the tissue(s) or cell type(s) present in. a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological disorders and hematopoeitic disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
• immunological probes for differential identification of the ussue(s) or cell type(s). For 
15 a number of disorders of the above tissues or cells, particularly of the neurological and 
hematopoeitic systems, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
20 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID. NO: 294 as residues: Lys- 13 to Leu-2 1 . 

The tissue distribution of this gene suggest that it may be useful in the detection - 
and/or treatment of neurodegenerative disease states and behavioral disorders such as ' 
25 Alzheimer's Disease, Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, 
dementia, paranoia, obsessive compulsive disorder and panic disorder, while its 
expression in hematopoietic cell types indicates that the gene could be important for the 
treatment or detection of immune or hematopoietic disorders including arthritis, asthma 
and immunodeficiency diseases. } - ■ 

30 ■ 

FEATURES OF PROTEIN ENCODED BY GENE NO: 62 
This gene is expressed primarily in fetal liver and fetal spleen. 
Therefore, polynucleotides and polypeptides of the invention. are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions," which include, but are 
not limited to, hematological and immunological disorders. Similarly, polypeptides and . 
antibodies directed to these polypeptides are useful in providing -immunological probes 
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for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune and hematopoetic systems.' 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 295 as residues: Ser-91 to Lys-98. 

The tissue distribution of this gene fetal liver and spleen indicates that the sene 
could be important for the treatment or detection of immune or hematopoietic disorders 
including arthritis, leukemia, asthma and immunodeficiency diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 

Gene shares homology with human serum amyloid protein. Preferred polypeptide 

fragments comprise the following amino acid sequence: 

.AI^TRIPPGDWVlNYTAVSFAGKpTARFFHSSPPSLGDQARTDPGHQRR^ (SEQ 
ID NO:520) (See Accession No. W13671). Also preferred are polynucleotide 
fragments encoding these polypeptide fragments This gene maps to chromosome 9, "and 
therefore, may be used as a marker in linkage analysis for chromosome 9 (See 
Accession No. AA004342). 

This gene is expressed primarily in fetal liver and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, hematopoietic and immune disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the hematopoietic and immune systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and- wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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The tissue distribution of this gene in fetal liver-spleen indicates that the gene 
could be important for the treatment or detection of immune or hematopoietic disorders 
including arthritis, leukemia, asthma, and immunodeficiency diseases. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 64 

This gene maps to chromosome 3, and therefore, may be used as a marker in 
linkage analysis for chromosome 3 (See Accession No. AA2 19669). 
• This gene is expressed specifically in the brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurodegenerative disease states. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful' in providing immunological probes for. 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 

15 the above tissues or cells, particularly of the neurological systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 

20 expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to'this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 
25 Disease, Huntintons Disease, schizophrenia, mania, dementia, paranoia, obsessive 
•compulsive disorder and panic disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 65 

Gene shares homology with a yeast protein. Preferred polypeptide fragments' 
30 comprise the following amino acid sequence: LQEVNITLPENSVWYERYKFDIP 

VFHL (SEQ ID NO:521). Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. (See Accession No. 1332638) 

This gene is expressed primarily in fetal tissue (fetus and fetal liver). 
Therefore, polynucleotides and polypeptides of the invention are useful as. 
35' reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, liver disorders and cancers- (e.g. hepatoblastoma).- Similarly, 
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polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the hepatic system, 
expression of this gene at significantly higher or lower levels may be routinely detected 

5 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or celi sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 

10 NO: 298 as residues: Asn-59 to Glu-64. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection and treatment of liver disorders 
and cancers (e.g. hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and 
conditions that are attributable to the differentiation of hepatocyte progenitor cells). In 

15 . addition the expression in fetus would suggest a useful role for the protein product in 
developmental abnormalities, fetal deficiencies, pre-natal disorders and various would- 
healing models and/or tissue trauma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 66 

Gene has homology with a B-cell surface antigen which may indicate gene plays 
a role in the immune response, including, but not limited to disorders and infections of 
the immune system. Preferred polynucleotide fragments comprise the following 
sequence: TAGCATGTAGCCAGTCGAATA.ACNTATAAGGACAAAGTGGAGTC 
CACGCGTGCGGCCGTCTAGACTAGTGGATCCCCCGGCTGCAGGATTCGGC 
ACGAG (SEQ ID NO:523). "Also preferred are polypeptide fragments encoded by 
these polynucleotide fragments (See Accession No.T94535). Additionally, this gene 
shares homology with an interferon-gamma receptor. Preferred polypeptide fragments 
also comprise the following amino acid sequence: MQGSGSQFRACLLCLCFSCPC 
SPGGPRWNSRQGGRRFPKTCRAISQNLVFKYKTFCPVRYMQPHRSSLCLHFTS 
YVFILSTWGSLRTYSTDLKKKKKNSRGGPVPIRPKS (SEQ ID NO:522); . 
MQGSGSQFR.ACLLCLCFSCPCSPGGPRWNSRQGGRRFPKTCRAISQNLVFK 
(SEQ ID NO:524); PVRYMQPHRSSLCLHFTSYVFILSTWGSLRTYSTDLKKKKK 
NSRGGPVPIRPKS (SEQ ID NO:525); and GEEQRDCSLGWRGVGMRATHCQAA 
RMFVLFSLPKYAGL (SEQ ID NO:526). .Also preferred are polynucleotide fragments 
encoding these polypeptide fragments 

This gene is expressed primarily in T -cells and gall bladder. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
.biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immunological disorders and conditions (immunodeficiencies, cancer, 
5 leukemia, hematopoeisis). Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
. of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and digestive systems, expression of this" gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

10 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or. cell sample taken. from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 299 as residues: 

15 Thr-41 toGly-52. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of immune 
-disorders including: leukemias, lymphomas, auto-immune disorders, immuno- 
supressive (transplantation) >and immunodeficiencies (e.g. AIDS), inflammation and 

20 hematopoeitic disorders. The expression of this gene in gall bladder would suggest a 
possible role for this gene product in digestive disorders, particularly of the pancreas. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 67 

This gene maps to chromosome 11, and therefore, may be used as a marker in 
25 linkage analysis for chromosome 1 1 (See Accession No. AA01 1622). 

This gene is expressed primarily in.a variety of fetal and developmental tissues 
(e.g. fetal spleen, infant brain). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) .present in a 
30. biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental, immune or neurological abnormalities. Similarly, 
polypepddes and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the developing 
,35 immune and central nervous systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid') or 
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another tissue or cell, sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 300 as residues: Ser-38 to Ser-43. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for developmental abnormalities or fetal 
deficiencies. The detection in infant brain would suggest a role in neurological disorders 
(both developmental and neurodegenerative conditions of the brain and nervous system, 
behavioral disorders, depression, schizophrenia, Alzheimer's disease, Parkinson's 
disease, Huntington's disease,, mania, dementia). In addition, the detection in spleen 
would similarly suggest a. role in detection and treatment of immunologically mediated 
disorders (e.g. immunodeficiency, inflammation, cancer, wound healing* tissue repair, 
hematopoeisis). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

This gene is expressed primarily in spleen, T -cells, and fetal heart. 
Therefore, polynucleotides $nd polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are . 
not limited to, immunological deficiencies, including AIDS and cardiovascular 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and cardiovascular systems, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from anindividual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an. individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of immune 
disorders including: leukemias, lymphomas, autoimmune disorders, 
immunodeficiencies-(e.g. AIDS), immunosuppressive conditions (transplantation) and 
hematopoeitic disorders. The expression in fetal heart indicates that polynucieoddes and 
' polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
cadiovascular disorders (e.g. heart disease, restenosis, atherosclerosis, stoke, angina, 
thrombosis). 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 69 

Gene shares homology with a human collagen protein. Preferred polypeptide 
fragments comprise the following amino acid sequence: 
5 MPRKTSKCRQLLCSGASRNADTAARQSTCSSHRPPGKIPSLGPRRXPGCXSVP 
SSRGEQSTGSPAAPRCGRRDAHRGLPGGAAMTPGDTWASFNPRAGHSKSQGE 
GQESSGASRQDRHPVSHWVTRQREAWGAPRSSSAGGVKV,\ATTER£PEFKIK 
TGKA (SEQ ID NO:527); CSGASRNADTAARQSTCSSHRPPGKIPSLGPRRXPG 
CXSVPSSRGEQSTGSPAAPRCGRRDAHRGLPGGAAMTPGDTWASFNPRAGHS 
10 - (SEQ ID NO:528); QGEGQESSGASRQDRHPVSHWVERQREAWGAPRSSSAGG 
VKVAATTEREPEFKIKTGKA (SEQ ID NO:529) (See Accession No. 124886). Also 
preferred. are polynucleotide fragments encoding these polypeptide. fragments 
This gene is expressed primarily in fetal heart. " 

Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a . 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cardiovascular disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

20 tissues or cells, particularly of the cardiovascular system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

25 in healthy tissue, or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 302 as residues: 
Pro-32'to Ser-39. r 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of cadiovascular 

30 disorders (e.g. heart disease, restenosis, atherosclerosis, stroke, angina, .thrombosis). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 70 

The translation product of this gene shares sequence homology with a chicken 
single-strand DNA-binding protein. Preferred polypeptide fragments comprise the 
35 following amino acid sequence: 

MSPRYPGGPRPPLRIPNQALGGVPGSQPLLPSGMDPTRQQGHPNMGGPMQRM 
TPPRGMVPLGPQNYGG.AMRPPLNALGGPGMPGMNMGPGGGRPWPNPTNAN 
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SIPYSSASPGNYVGPPGGGGPPGTPLViPSPADSTNSGDNMYTLMNAVPPGPNR 
PNFPMGPGSDGPMGGLGGMESHHMNGSLGSGDMDSISKNSPNNMSLSNQP 
GTPRDDGEMGGNFLNPFQSESYSPSMTMSV (SEQ ID NO:530); MSPRYPGG 
PRPPLRIPNQALGGVPGSQPLLPSGMDPTRQQGHPNMGGPMQRMTPPRGMVP 

5 LGPQNYGGAMRPPLNALGGPGMPGMNMGPGGGRPWPNPTNANSIPYSSASP 
GNY (SEQ ID. NO:53 1); LNALGGPGMPGMNMGPGGGRPWPNPTNANSIPYSS 
ASPGNYVGPPGGGGPPGTPIMPSPADSTNSGDNMYTLMNAVPPGPN (SEQ ID 
NO:532); GPMGGLGGiVIESHHMNGSLGSGDMDSISKNSPNNMSLSNQPGTPR 
DDGEMGGNTLNPFQSESYSPSMTMSV (SEQ ID NO:533); TCEHSSEAKAFHDY 

10 (SEQ ID NO:534). Also preferred are polynucleotide fragments encoding these 

polypeptide fragments. (See Accession No. 1562534) 
• This gene is expressed primarily in placenta and to a lesser extent in the fetal 

heart and a variety of other tissues and cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or ceil type(s) present in a 

.biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities, fetal deficiencies, and particularly of the 
cardiovascular system. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

20 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
• . .particularly of the reproductive system, expression of this gene at significantly higher or 
lower levels may be routinely. detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids' (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

25 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection and treatment of developmental 
abnormalities or fetal deficiencies, ovarian and other endometrial cancers, reproductive 
. 30 dysfunction, cardiovascular disorders, and pre-natal disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

This gene is expressed primarily in fetal liver and to a lesser extent in the breast 
-and testes. - 
35 Therefore, polynucleotides and polypeptides of the invention are useful as * 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are. 
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not limited to, liver disorders (including hepatoblastomas) and reproductive disorders. 
Similarly, polypeptides and antibodies directed to these polypepddes are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
5 hepatic and reproductive systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in- certain 'tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
■ another tissue or cell sample taken from an individual having such a disorder, relative to 
.the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

10 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for detection and treatment of liver disorders and 
cancers (e.g. hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and 
conditions that are attributable to the differentiation of hepatocyte progenitor cells). The 

15 expression in testes and breast indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the detection and treatment of endocrine and 
reproductive disorders (e.g. sperm maturation, milk production, testicular and breast 
cancers). 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

This gene maps to chromosome 1, and therefore, may be used as a marker in 
linkage analysis for chromosome 1 (See Accession No. W93595). - 

This gene is expressed primarily in smooth muscle and to a lesser extent in . 

brain. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cardiovascular and neurological disorders. Similarly, polypeptides and r- - 
antibodies directed to these polypepddes are useful in providing immunological probes 

30 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the cardiovascular and central nervous 
systems, expression of this gene at significantly higher or. lower levels may be routinely 
. detected in certain tissues (e.g., cancerous and wounded tissues), or bodily fluids (e:g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 

35 taken from an individual having such a disorder; relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an " 
individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides . 
corresponding to this gene are useful for the detection and treatment of restenosis, 
atherosclerosis, stroke, angina, thrombosis, wound healing and other conditions of 
heart disease. In addition, the expression in brain would suggest that polynucleotides 
5 and polypeptides corresponding to this gene are useful for the detection and treatment of 
developmental, degenerative and behavioral conditions of the brain and nervous system 
(e.g. schizophrenia, depression, Alzheimer's disease, Parkinson's disease, 
Huntington's disease, mania, dementia, paranoia, addictive behavior and sleep 
disorders). .' 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 73 

Gene shares homology with human stromalin-2. Preferred polypeptide 
fragments comprise the following amino acid sequence: 

QAFVLLSDLLLIFSPQMrVGGRDFLRPLVFFPEATLQSELASFLMDHVFIQPGDL 

15 GSGA (SEQ ID NO:535); ACSYLLCNTEFTFFSRADFARSQLVDLLTDRFQQE 
LEELLQVG (SEQ ID NO:536),QKQLSSLRDRMVAFCELCQSCLSDVDTEIQ£QV 
ST (SEQ ID NO:537); QVILPALTLVYFSILWTLTHISKSDAS (SEQ ID NO:538); 
STHDLTR WELYEPCCQLLQKAVDTGXVPHQV (SEQ ID NO:539). Also preferred 
are polynucleotide fragments encoding these polypeptide fragments (See Accession 

20 No.R-65208 ) This gene. maps to chromosome 7, and therefore, may be used as a 
marker in linkage analysis for chromosome 7 (See Accession No! D52585). 

This gene is expressed primarily in the brain (infant brain,. adult brain, pituitary, 
.cerebellum, hippocampus, schizophrenic hypothalmus, amygdala). 

Therefore, polynucleotides and polypeptides of the invention are useful, as 

25 reagents for differential identification of the tissue(s) or cell type(s) .present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental and neurodegenerative diseases of the brain and nervous 
system. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 

30 type(s). For a number of disorders of the above tissues or cells, particularly of the' 
central nervous system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

35 standard gene. expression' level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
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comprising a sequence shown in SEQ ID NO: 306 as residues: Thr-25 to Lys-36, Lys- 
55 to Ser-63. ' 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for detection and treatment of developmental, 
5 degenerative and behavioral conditions of the brain and nervous system (e.g. 

schizophrenia, depression, Alzheimer's disease, Parkinson's disease, Huntington's 
disease, mania, dementia, paranoia, addictive behavior and sleep disorders). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 74 
10 This gene is expressed primarily in the hypothalamus of a human suffering from 

schizophrenia. 

. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential' identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
15 not limited to, disorders of the CNS particularly schizophrenia. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
" disorders of the above tissues or cells, particularly of the CNS, such as schizophrenia 
expression of this gene at significantly higher or lower levels may be routinely detected 
20 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or. cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
• the expression level inhealthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising' a sequence shown in SEQ ID 
25 NO: 307 as residues: Gly-38 to Ala-44. 

The tissue distribution indicates that the protein products of this gene are useful 
for the study, diagnosis and treatment of schizophrenia and other disorders involving 
the CNS. 



30 FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

Preferred polypeptides of' the invention comprise the following amino acid 
sequence encoded by this gene: 

LAVSTSFICCADISTALPLGSSRPAPAPRHREHEHGHQARPPRLLXTSLMPLSTP 
AAAQLLWTQLTPMGGRPGGRHSPPTLHTGPFLALPPGPPHPSLHVAALSLLR 
35 (SEQ ID NO:540). Polynucleotides encoding such polypeptides are also provided. 

• This gene is expressed primarily in endometrial tumor and to a lesser extent in 
amniotic cells. 
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Therefore, polynucleotides and poiypepcides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, reproductive and immune disorders particularly cancers of those 

5 systems. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive and immune systems, expression of this gene at significantly higher or ' 
lower levels may be routinely detected in 'certain tissues (e.g., cancerous and wounded 

10 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
■ comprising a sequence shown in SEQ ID NO: 308 as residues: Ser-3 to Arg-9. 

15 The tissue distribution indicates that the protein products of this gene are useful 

for study and treatment of immune and reproductive disorders particularly cancers of 
those systems. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

20 This gene is expressed primarily in kidney cortex and to a lesser extent in early- 

stage human brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, renal disorders such as renal cancer. Similarly, polypeptides and 

antibodies directed to. these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the kidney expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

30 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or ceil sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 309 as residues: 

35 Gly-38 to Gly-45, Gly-47 to Gly : 52, Pro-92 to Lys-1 10. 

The tissue distribution indicates that the protein products of this gene are useful 
for study, treatment and diagnosis of renal diseases such as cancer of the kidney.:;. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 77 

This gene is expressed primarily in kidney medulla. 
Therefore, polynucleotides- and poly peptides, of the invention are useful as 
5. reagents for differential identification of the tissue(s) or cell rype(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, metabolic and renal disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a a number of disorders of 
10 the above tissues or cells, particularly of the metabolic and renal systems, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
15 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for study, treatment and diagnosis of metabolic and renal diseases and disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 78 

This gene is expressed in chronic synovitis and microvascular endothelium. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
. biological sample and for diagnosis of diseases and conditions, which include, but are 
25 not limited to, arthritis and atherosclerosis. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
.the above tissues or cells, particularly of the vascular and skeletal systems, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
30 tissues (e.g., cancerous" and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level i.e.,. 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

35 The tissue distribution indicates that the protein products of this gene are useful 

. for study, diagnosis and treatment of arthritic and other inflammatory diseases as well 
' 'as cardiovascular diseases. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 79 

This gene is expressed in resting T-cells and activated monocytes. 
Therefore, polynucleotides and polypeptides of the invention. are useful as 
5 reagents for differential identification of the dssue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
10 tissues or cells, particularly of. the immune system, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues). or bodily fluids (e.g., serum, plasma, urine, synovial 
.fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
- such a disorder, relative to the standard gene expression level, i.e., the expression level 
15 in healthy tissue or bodily fluid from an individual not having the disorder 

The tissue distribution indicates that the protein products of this gene are useful 
for the study and treatment of immune diseases such as inflammatory conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

20 This gene is expressed in a variety of immune system tissues, e.g., neutrophils, 

T-cells, and TNF induced epithelial and endothelial cells. ■ 

Therefore, polynucleotides and polypepddes of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, infectious and immune disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune and vascular systems, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 

30 tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID ■ 

35 NO: 3 13 as residues: Met-1 to Trp-6. 

The tissue distribution indicates that the protein products of this gene are useful 
for study and treatment of infectious diseases, immune and vascular disorders. ; 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 81 
This gene is expressed in activated neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in. a 

, biological sample and for diagnosis of diseases. and conditions, which include, but are 
not limited to, inflammation and other immune conditions. Similarly, polypeptides and 
• antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
10 of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g.,.' 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
15 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for study and treatment of immune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 82 

20 . This gene is expressed in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammatory and other immune conditions. Similarly, polypeptides and 

25 antibodies directed'to these polypeptides are useful in providing immunological probes - 
for differential identification of the tissue(s) or cell type(s). For. a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

30 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 315 as residues: 
• Aia-83 toThr-91. 

35 The tissue distribution indicates that the protein products of this gene are useful 

for study and treatment of immune disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 83 
This gene is expressed In human neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differencial identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation and immune disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

10 of the above tissues or cells, particularly of the immune and inflammatory system, 

expression of this gene at significantly higher or lower levels may be routinely-detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, ' 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

15 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for diagnosis and treatment of disorders of the inflammatory and immune systems. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 84 
This gene is expressed in human neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, disorders of the inflammatory and immune systems. Similarly, 

polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or ceils, particularly of the inflammatory and 
immune systems, expression of this gene at significantly higher or lower levels may be 

30 routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

35 The tissue distribution indicates that the protein products of this gene are useful 

for diagnosis and treatment of disorders of the immune and inflammatory systems. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 85 
This gene is expressed in activated neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 • reagents for differential identification of the tissue(s) or cell type(s) present in a 
. j biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation and immune system diseases. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
10 of the above tissues or cells, particularly of the immune system and inflammatory 

system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
15 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for diagnosis and treatment of diseases. of the inflammatory and immune systems. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 86 
This gene is expressed in activated neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

25 not limited to, inflammation and immune system disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identificadon of the tissue(s) or cell type(s)i For a number of disorders 
of the above tissues or cells, particularly of the inflammatory and immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 

30 , in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual- not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 

35 NO: 3 19 as residues: Met-1 to Giy-6, Gly-32 to Pro-43, Leu-55 to Gln-60. 

The tissue distribution indicates that the protein products of this gene are useful 
for diagnosis and treatment of disorders of the immune and inflammatory system. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 87 

In specific embodiments, polypeptides of the invention comprise the sequence: 
EQVLALLWPRFELILE^INVQSVRSTDPQRLGGLDTRPH^ITRRY.AEFSSALVSIN 

5 QTffMERTMQLLGQLQVEVBNFVLRVAAEFSSRKEQLVF^ 

RAADDS KEVESFQQLLN ARTQEFTEELLS PPFGGL V Af^KE A£ ALJERGQ AERLR 
GEEARVTQLIRGFGSSWKSSVESLSQDVMRSFTNFRNGTSIIQG (SEQ ID 
NO:54 I ),ALLKYRFFYQFIiGNERATAKEIRDE YA/ETLS KIYLS Y YRS YLGRLMK 
VQYEEV,\EKDDLMGVEDTAKKGFXSKPSRSRNTIFTLGTRGSVISPTELEAPILV 

10 PHTAQR (SEQ ID NO: 542)l EQRYPFEALFRSQHY"XLLDNSCREYLF1CEFFVVS 
GP.^HDLFHAV^IGRTLSMTLKHLDSYLACCYDAIAVFLCIHIVLRFRNIAAKRD 
VPALDRYW (SEQ ID NO:543),GGLDTRPHYITRRY AEFSS ALVSINQ (SEQ ID 
NO:544); SRKEQLVFLINNYDMMLGVL (SEQ ID NO: 545) and/or ALLKYRFFY 
QFLLGNERATAKEIRDEYVETLSKIYLSYYRSYLGRLMKVQYEEV.AEKDDLMG 

15 VEDTAKKGFXSKPSLRSRNTIFTLGTRGSVISPTELEAPILVPHTAQRXEQRYPF 
EALFRSQHYXLLDNSCREYLFICEFFVVSGPXAHDLFHAVMGRTLSMTLKHLD 
SYLADCYDAJAVFECmTvl.RFRNLAAKRDVP.ALDRYWEQVLALL\W 
NVQSVRSTDPQRLGGLDTRPHYITRRYAEFSSALVSINQTIPNERTMQLLGQLQV 
EVENF^TRVA.AEFSSRKEQLVFLIN^YDNn\ILGVLMERAADDSK^ 

20 ARTQEFmELLSPPFGGLVAFVKEAEALIERGQAERLRGEEARVTQLIRGFGSSW 
KSS VES LS QD VMRSFTNFRN GTS (SEQ ID NO:546). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. The translation product of 
this gene shares sequence homology with suppressor of actin mutation which is thought 
to be important in mutation suppression. 

25 This gene is expressed primarily in fetal liver and to a lesser extent in a variety 

of other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

30 not limited to, liver and mutations. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the liver or cancer, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

35 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
. such a disorder, relative to the standard gene expression level, i.e., the expression level 
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in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 320 as residues: 
Val-53 to Arg-60, Thr-88 to Thr-94, Ala- 142 to Ser-150, Gly-188 to Glu-196, Gly- 
208 to Ser-214, Thr-227 to Gly-232, Lys-279 to Phe-285. 

The tissue distribution and homology to suppressor of actin mutation surest 
that polynucleotides. and polypeptides corresponding to this gene are useful for 
diagnosis and of liver disorder' or cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 88 

This gene maps to chromosome 9, and therefore can be used in linkage analysis 
as a marker for chromosome 9. In specific embodiments, polypeptides of the invention 
comprise the sequence: 

YEGKEFDYVFSIDV^ffiGGPSYKLPYNTSDDPWLTAYNFLQKNDLNP^IFLDQVA 
KFIIDNTKGQMLGLGNPSFSDPFTGGGRYVPGSSGSSNTLPTADPFTGAGRYV 
PGSASMGTTMAGVDPFTGNSAYRSAASKTNINIYFPKKEAVTFDQANPTQELGK 
LKELNGTAPEEKKLTEDDLnj-ElOLSLICNSSSEKPTVQQLQrLWKAINCPEDIV 

FPALDILRLSIKHPSVNENFCNEKEGAQFSSHLINLLNPKGKPANQLLALRTFC 
NCFVGQAGQKLMMSQRESLMSHAIELKSGSNKNI (SEQ ID NO: 547); 
HI AL ATL AJLNYS VCFHKD (SEQ ID NO: 548); HNIEGKAQCLSLISTILEVVQ 
. DLEATFRLLVALGTLISDDSNAVQLAKS (SEQ ID NO:549); LGVDSQIKKYSS 
VSEPAKVSECCRFILNLL (SEQ ID NO:550); and/or YEG KEFD Y VFS ID VNEGGPS 
YKLPYNTSDDPWLTAYNFLQKNDLNPNIFLDQVAKFnDNTKGQMLGLGNPSFS 
DPFTGGGRYVPGSSGSSNTLPTADPFTGAGRYVPGSASMGTTMAGVDPFTGN 
SAYRSAASKTNINlYFPKKEAVTFDQANPTQn-GKLKELNGTAPEEKKLTEDDLI 
IXEKTLSLia^SSSEKPTVQQLQILWKAlNCPEDIW^ 

NEKEGAQFSSHLINLLNPKGKPANQLLALRTFCNCFVGQAGQKL^IMSQRESL 
MSHAIELKSGSNKNIHIALATLAENYSVCFHKDHNIEG^QCLSLISTILEVVQD 
IJEATFRLLVALGTLISDDSNAVQLAKSLGVDSQIKKYSSVSEPAKVSECCRFILN 
LL (SEQ ID NO:551). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. These polypeptides share significant homology with / 
phospholipase A2 activating protein which is thought to be important in signal 
transduction (see, e.g., Wang et al., Gene 161(2):237-241 (1995)). 

This gene is expressed primarily in endothelial cells, to a less extent in placenta, 
endometrial stromal cells, osteosarcoma, testis tumor, muscle, and infant brain that are 
likely to be rich in blood vessles. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the dssue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders in vascular system, aberrent angiogenesis, tumor angiogenesis. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
5 type(s). For a number of disorders of . the above tissues or cells, particularly of the 
vascular system or tumors, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

10 the standard gene expressionievel, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution of this gene in endothelial cells and several potential 
highly vascularized tissues and its homology to phospholipase A2 activating protein 
suggest that this gene may be involved in transducing signals for endothelial cells in 

15 angiogenesis or vasculogenesis. ' 

FEATURES OF PROTEIN ENCODED BY GENE NO: 89 

In specific embodiments, polypeptides of the invention comprise the sequence; 
YPNQDGDILRDQVLHEHIQRLSKWTANHR.^LQIPEV^XREAPWPSAQSEIRTIS 
.20 AYKTPRDKVQCILRMCSTIMN^ 

STVQYISSFYASCLSGEESYWWMQFTAAVE (SEQ ID NO:552); YPNQDGDILR 

DQVLHEHIQRLSKVVTANHR.ALQ1PEVYLREAPWPSAQSEIRTISAYKTPRDKVQ 

CILRMCSTIMNXLSLANEDSVPGADDFW 

SCLSGEES YWWMQFTAAVEFIKTI (SEQ ID NO:553); YPNQDGDELRDQVL (SEQ 
25 ID NO:554): EAPWPSAQSEI(SEQ ID NO:555); PVLVFVLIKANP (SEQ ID 
NO:560); SGEESYWWMQFTAAVEHKTI (SEQ ID NO:556); ADDFVPVLVF 
. VLIKANPP (SEQ ID NO:557); YKTPRDKVQCEL (SEQ ID NO:558); and/or 
GADDFVPVLVFVLIK (SEQ ID NO:559). The translation product of this gene shares 
sequence homology with human ras inhibitor and yeast VPS9p which is thought to be 
30 important in golgi vacuole transport. 

This gene is expressed primarily in T cells and melanocytes and to a lesser 
extent in a variety of other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions, which include, but. are 
not limited to, dysfunction and disorders involving T cells and melanocytes. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
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immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
5 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
.the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and^homology to ras inhibitor indicates that 
10 polynucleotides and polypeptides corresponding to this gene are useful for regulating 
signal transduction; diagnosis and treatment of disorders involving T cells and 
melanocytes. " 

FEATURES OF PROTEIN ENCODED BY GENE NO: 90 
15 This gene maps to chromosome 9" and therefore polypeptides of the invention 

can be used in linkage analysis as a marker for chromosome 9: The translation product 
of this gene shares sequence homology with neuronal olfactomedin-related ER localized 
protein which is thought to be important in influence the maintenance, growth, or 
differentiation of chemosensory cilia on the apical dendrites of olfactory neurons. In 
20 specific embodiments, polypeptides of the invention comprise the sequence: 

S ARASTQPP AGQHPGPC (SEQ ID NO:561); MPGRWRWQRDMHPARKLLSLL 
FLILMGTELTQD (SEQ ID NO:562); S A APDS LLRS S KGSTRGS L (SEQ ID 
NO:563); AAIVIWRGKSESRIAKTPGI (SEQ ID NO:564); FRGGGTLVLPPTHT 
PEWLEL (SEQ ID NO:567); PLGITLPLGAPETGGGD (SEQ ID NO:565); and/or 
25 CAAETWKGSQRAGQLC ALLA (SEQ ID NO:566). 
This gene is expressed in pineal gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

30 not limited to, neurological and endocrinological disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders- 
of the above tissues of cells, particularly of the neurological or endocrine systems, 
expression of this gene at significantly higher or lower levels may be .routinely detected 

35 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
. plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample. taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 323 as residues: Leu-20 to Ala-26, Arg-32 to Arg-39, Thr-104 to Gly-1 12. 

The tissue distribution and homology to olfactomedin-related protein indicates 
5 that polynucleotides and polypeptides corresponding to this gene are useful for 

maintenance, growth, or differentiation of neuron cells in pineal gland, therefore, may 
be useful for diagnosis and treatment of neurological disorders in pineal gland. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

10 This gene is expressed primarily in prostate and apoptotic T cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, prostate disease and T cell dysfunction. Similarly., polypeptides and 

15 antibodies directed to these polypeptides are useful in providing immunological probes 
. for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the prostate cancer, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

20 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individuai not having the disorder. 

The tissue distribution indicates that polynucleoddes and polypeptides 
corresponding to this gene are useful for detect abnormal activity in prostate andT cells 

25 or probably treatment of this abnormality. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

This gene is expressed primarily in prostate and to a lesser extent in smooth 
muscle cells, fibroblasts, and placenta. 

30 Therefore, polynucleoddes and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders in prostate. or vascular system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

35 ^for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the prosate or vascular system, expression 
- of this gene at significantly higher or lower levels may be routinely detected in certain 
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tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, ' 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
5 disorder. 

- The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for regulating function of prostate or highly 
vascularized tissues, e.g. placenta. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 93 

This gene is expressed primarily in embryos and fetal" tissues stage human and 
to a lesser extent in a wide variety of other proliferative tissues. * 

. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders in embryonic development and cell proliferation. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing ■ 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the embryonic tissues 

20 and proliferative cells, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

25 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatment of abnormalities in 
developing and proliferative cells and organs. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 94 

The translation product of this gene shares sequence homology with 
transformation related protein which is thought to be important in transformation. 

This gene is expressed primarily in female reproductive tissues, i.e., breast 
cancer cells, placenta, and ovary and to a lesser extent in fetal lung. 
35 Therefore, polynucleotides and polypeptides of the invencion are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
. biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, cancer or dysfunction of reproductive tissues. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the reproduction system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
the expression level in healthy, tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 327 as. residues: Ser-50 to Pro-6 1 . 

The tissue distribution and homology to transformation related protein indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of conditions caused by transformation, i.e. tumorigenesis in 
reproductive organs, e.g. breast, placenta, and ovary. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

This gene is expressed primarily in testes, rhabdomyosarcoma, infant brain and 
to a lesser extent in some tumors and highly vascularized tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, tumorigenesis, abnormal angiogenesis, and/or neurological disorders. , 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
tumor tissues or vascular tissues, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues), or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 328 as residues: Arg-46 to Trp-54, Pro- 
60 to Ile-69, Asn-116 to Ala- 122, Arg-147 to Lys-153, Ser-158 to Glu-170, Ile-399 to 
Ser-405, Pro-486 to Met-499, Pro-502 to Asp-508. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for a range of disease states including treatment of 
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tumor or vascular disorders and the treatment of neurological disorders such as 
Alzheimer's Disease, Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, 
dementia, paranoia, obsessive compulsive disorder' and panic disorder. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 96 

This gene maps to chromosome 7 and therefore polynucleotides of the present 
invention can be- used in linkage analysis as a marker for chromosome 7. The 
translation product of this gene is homologous to the Clostridium perfringens 
enterotoxin (CPE) receptor gene product and shares sequence homology with a human 

10 . ORF specific to prostate and a" glycoprotein specific to oligodendrocytes both- of which 
are tissue specific proteins.fSee e.g., Katahira et al., J Cell Biol. 136(6): 1239-1247 
(1997). PMID: 9087440; UI: 97242441. . • - ' 

This gene is expressed primarily in pancreas tumor and ulcerative colitis and to a 
lesser extent in several tumors and normal tissues. 

15 therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis 'of diseases and conditions, which include, but are 
not limited to, pancreatic disorder, ulcerative colitis, tumors and food poisoning. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s).._For a number of disorders of the above tissues or cells, particularly of the 
digestive system or tumorigenic system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
, wounded tissues) or bodily fluids (e:g., serum, plasma, urine, synovial fluid or spinal 

25 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. Preferred epitopes include 
those comprising a sequence shown in SEQ ID NO: 329 as residues: Gly-147 to Met- 
152, Cys-177 toLys-188. 

30 The tissue distribution and homology to prostate and oiigodendrocyte-specific 

protein indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for marker of diagnosis or treatment of disorder in pancreas, ulcerative colitis, 
and tumors. Furthermore, identity to the human receptor for Clostridium perfringenes 
entertoxin indicates that the soluble portion of this receptor could be used in the 

35 treatment of food poisoning associated with Clostridia perfringens by blocking the 
activity of perfringens enterotoxin. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 97 

The translation product of this gene shares sequence homology with ATPase 

which is thought to be important in metabolism. 
5 This gene is expressed primarily in testes and several hematopoietic cells and to 

a lesser extent in other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
10 not limited to, leukemia and hematopoietic disorders. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
. for differential identification of the tissue(s) or cell type(s). For a number of disorders 

of the above tissues or cells, particularly of the hematopoietic system, expression of this. 

gene at significantly higher or lower levels may be routinely detected in certain tissues 
15 (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 

having such a disorder, relative to the standard gene expression level, i.e., the 

expression level in healthy tissue or bodily fluid from an individual not having the 

disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
20 NO: 330 as residues: Leu-37 to Ala-42. 

The tissue distribution and homology to ATPase indicates that polynucleotides 

and polypeptides corresponding to this gene are useful for marker of diagnosis and 

treatment of leukemia and other hematopoietic disorders. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 98 

In specific embodiments, polypeptides of the invention comprise the sequence: 
MRS ARPSLGCLPSWAFSQALNI (SEQ ID NO:568); LLGLKGLAPAEISAVCE 
KGNFN (SEQ ID NO:569); VAHGLAWSYYIGYLRLILPELQARIR (SEQ ID 
NO:570); TYNQHYNNLLRGAVSQRC (SEQ ID NO:571); ILLPLDCGVPDNLSM 

30 ADPNIRFLDKLPQQTGDRAGIKDRVYSN (SEQ ID NO:572); SIYELLENGQRAGT 
CVLEYATPLQTLFAMSQYSQAGFSGEDRLEQ (SEQ ID NO:573); AKLFCRTLE 
DILADAPESQNNCRLLAYQEPADDSSFSLSQEVL 

PSTSTMSQEPELLISGMEKPLPLRTDFS (SEQ ID NO:574); and/or LLGLKGLA 
PAEIS AVCEKGNFNVAHGLAWS YYIGYLRLILPEL (SEQ ID NO:575). ' 
35 Polynucleotides encoding these polypeptides are. also encompassed- by the invention. 

This gene is expressed primarily in prostate BPH and to a lesser extent in bone 
marrow. - 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include,,, but are 
- not limited to, benign prostatic hypertrophy or prostate cancer. Similarly, polypeptides 
5 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of ' 
disorders of the above tissues or cells, particularly of the male urinary system, 
expression of this .gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level'in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 33 1 as residues: Ue-60 to Asn-69, Leu-106 to Asp-1 12, Glu-130 to Gly-136/Phe- 

15 160 to Glu- 167, Pro- 184 to Cys-190, Giu-197 to Ser-202, Arg-215 to Glu-221, Thr- 
237 to Pro-242. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatment of benign prostatic 
hypertrophy or prostate cancer. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 99 
This gene is expressed primarily in salivary gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, disorders or injuries of the salivary gland. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of glandular tissues, expression of this gene at 

30 significantly higher or lower levels'may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids.(e.g., serum, plasma, urine, synovial 
fluid' or spinal fluid) or another tissue or cell sample taken from an individual having 
. such a disorder, relative to the standard gene expression level, i.e., the expression level 
in' healthy tissue or bodily fluid from" an individual- not having the disorder. 

35 The tissue distribution- indicates that polynucleotides and polypeptides- 

corresponding to this gene are useful for treatment of disorders of, or injuries to the 
salivary gland or other glandular tissue. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 100 

This gene maps to chromosome 15, accordingly, polynucleotides of the 
invention can be used in linkage analysis as a marker for chromosome 15. The 
5 translation product of this gene shares sequence homology with a Celegans -gene of 
unknown function. In specific embodiments, polypeptides of the invention comprise 
the sequence: DPRVRLNSLTCKHIFISLTQ (SEQ ID NO:583); TMKLLKLRRNIV 
KLSLYRHFTN (SEQ ID NO:576); TLILAVAASIVFITWTT^IKFRI (SEQ ID 
NO:577); VTCQSDWRELWVDDAIWRLLFS^OLFVI (SEQ ID NO:578); MVLWR 

10 PS ANNQRF AFS PLSEEEEED EQ (SEQ ID NO:580); KEPMLKESFEGMKMRS 

TKQEPNGNSKVNKAQEDDL (SEQ ID NO:584); and/or KWVEENVPSSVTDVALP 
ALLDS DEE RMTTHFERS KME (SEQ ID NO:582). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

This gene is expressed primarily in thyroid and to a. lesser extent in 

15 osteoclastoma, kidney medulla, and lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, thyroid dysfunction or cancer. Similarly, polypeptides and antibodies 

20 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the endocrine system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

25 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 333 as residues: 
Lys-107 to Leu-124, Glu-150 to Thr-159, Pro-173 to Asp-179, Ser-192 to Ser-201. 

30 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of thyroid dysfunction 
or cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 101 

35 : This gene maps to chromosome 16, therefore polynucleotides of the invention 

can be used in linkage analysis as a marker for chromosome 16. In specific 
embodiments, polypeptides of the invention comprise the sequence: 



. WO 93/54963 




PCT/US93/11422 



78 

IRHELTVLRDTRPACA (SEQ ID NO:585); and/or MDFXjVIALIYD (SEQ ID ■ 
NO:586). Polynucieotides encoding these polypepcides are also encompassed by the 
invention. 

- This gene is expressed primarily in kidney cortex and to a lesser extent in adult 
5 . brain, corpus colosum, hippocampus, and frontal cortex. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biologicaJ sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological disorders. Similarly, polypeptides and antibodies directed to 
10 \ these polypeptides are useful in providing immunological probes for differential 

identification of the tissue (s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the central nervous system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
15 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene, expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
. corresponding to this gene are useful for treatment or diagnosis of neurological 
20 disorders. ' 

FEATURES OF PROTEIN ENCODED BY GENE NO: 102 

In specific embodiments, polypeptides of the invention comprise the sequence: 

MQEMiVIRNQDRALSNLESIPGGYNA (SEQ ID NO:587); LRRMYTDIQEPMLSA 
25 AQEQF GGNPF (SEQ ID NO:58S); ASLVSNTSSGEGSQPSRTENRDPLPNPVVAP- 

QT (SEQ ID NO:589); SQSSSASSGTASTVGGTTGSTASGTSGQSTTAPNLVPGV 

GASMFNTPG MQSLLQQITENPQLMQNMLSAPY (SEQ ID NO:590); 

MRSiVLMQSLSQNPDLAAQMML^ (SEQ ID 

NO:59 1 ); MQNPDTLS AMSNPRAMQ ALLQIQQGLQTLATEAPGLIPGFTPGLG 
30 ALGSTGGSSGTNGSNATPSENTSPTAGT (SEQ ID NO:592); TEPGHQQFI 

QQMLQALAGVNPQLQNPEVRFQQQLEQL^ 

EERLLGSQPS (SEQ ID NO:593); RNPAMMQEMMRNQDRALSNLESIPGGY 
N ALRRM YTDIQEPMLS AA (SEQ ID NO:594); GNPFASLVSNTSS (SEQ ID 
NO:595); ENRDPLPNPWA (SEQ ID NO:595); GKILKDQDTLSQHGIHD (SEQ ID 
35 NO:597); GLTVHLVIKTQNRP (SEQ ID NO:598); S ELQSQMQRQLLSNPEMM 
(SEQ ID NO:599); PEISHMLNNPDIMR (SEQ ID NO:600); and/or 
RQLIMANPQMQQLIQRNP (SEQ ID NO:601). Polynucleotides encoding these 
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polypeptides are also encompassed by the invention. 
This gene is expressed primarily in breast. • 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or eel] type(s) present in a 
biological sample and for diagnosis of diseases and conditions,, which include, but are 
not limited to, breast cancer. Similarly, polypeptides and antibodies direcred to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of tumor systems, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal- fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene, expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of some types of 
breast cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 103 

The translation product of this gene shares sequence homology with secreted 
serine proteases and lysozyme C precursor, which is thought to be important in ' 
bacteriolytic function. In specific embodiments, polypeptides of the invention comprise 
the sequence: NLCHVDCQDLLNPNLLAGIHCAKRIVS (SEQ ID NO:602); 
LDGFEGYSLSD'WLCLAFVESKFN (SEQ ID NO:603); 
NENADGSFDYGLFQINSHYWCN (SEQ ID NO:604); and/or 
NLCHVDCQDLLNPNLLAGIHCAKRIVS (SEQ ID NO:605). Polynucleotides • 
encoding these polypeptides are also encompassed by the invention. 
This gene is expressed primarily in testes. 

Therefore, polynucleotides and polypeptides" of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, infection. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) of 



WO 98/54963 



PCT/US98/11422 



• 80 

another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 336 as residues: Ile-62 to Phe-70, Asn- 
5 . 78 to Asn-84. 

The tissue distribution and homology to iysozyme C precursor indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for boosting the 
money te -macrophage system and enhance the activity of immunoagents. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 104 
This'gene is expressed primarily in apoptotic T-celL 
Therefore, polynucleotides and polypeptides of the^ invention are useful as 
•reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

15 not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in. providing immunological probes for differential 
identification of the tissue(s) or cell' type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

20 . cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for treatment and diagnosis of some immune 
disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 105 

The translation product of this gene shares sequence homology with ARI 

30' protein of Drosophila (accession 2058299; EMBL: locus DMARIADNE, accession 

X98309), which is thought to be important in axonal path-finding in the central nervous. 

system. In specific embodiments, polypeptides of the invention comprise the sequence ' 

- IREVNEVIQNPAT (SEQ ID NO:606); ITRILLSHFNWDKEKLMERYF 

DGNLEKLFA (SEQ ID NO:607); NTRSSAQDMPCQICYLNYPNSYF (SEQ ID 

35 NO:608);TGLECGHKFCMQCWSEYLTTKEVIEEGMGQTISCPAHG(SEQE) 
NO:6 14); CDILVDDNTVMRLITDSKVK^ 

CHHWKV Q YPD AKP V (SEQ ED NO:609); CDILVDDNTVMRLrTDSK 
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VKL K Y Q HLITNS F VECNRL L KWCP APDCHH V V K V (SEQ ED NO:610); . 
GCNHMVCRNQNCKAEFCWC'LGP 

RSRAALQRYL (SEQ ID NO:61 1); FYCNRYMNHMQSLRFEHKLYAQVKQ 
KMEEMQQHNMSWIEVQFL (SEQ ID NO: 612); 

YWAFYLKKjNTnQSHFEN^^ 

RYCESR (SEQ ID NO:613) Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily in adult brain, and to a lesser extent in 
endometrial tumor, melanocytes, and infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases or injuries involving axonai path development. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the dssue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the central nervous 
system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having. the disorder. 

The tissue distribution and homology to ARI protein indicates that 
polynucleotides and polypeptidesxorresponding to this gene are useful for treatment of 
disease states or injuries involving axonai path development, including 
neurodegenerative diseases and nerve injury. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 106 

The translation product of this gene shares sequence homology with cytochrome 
b561 [Sus scrofa] which is thought to be an integral membrane protein of 
neuroendocrine storage vesicles of neurotransmitters and peptide hormones. 

This gene is expressed primarily in frontal cortex and to a lesser extent in 
rhabdomyosarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the 'tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurological disorders. Similarly, polypeptides and antibodies directed to 
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these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues-or cells, particularly of the central nervous system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

5 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 339 as residues: 

10 Ser-18 to Pro-24. 

The tissue distribution and homology to cytochrome b56l [Sus scrofa] indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for . 
.treatment and diagnosis of neurological disorders. This gene may also be important in 
regulation of some types of cancers. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 107 

In specific embodiments, polypeptides of the invention comprise the sequence: 
MWGYLFVD AAWNFLGCLICGW (SEQ ID NO:615); MHFISS GNVS AIRSSILLL 
RXSLSYLGNCLRVSAIFVYFLLFLLLS (SEQ ID NO;616); and/or MDQALRGSPSE 
20 GFSTDPSPPQVGRQIPSFPPWRRLVLPK^SGCFLERE\VWLCVFKLRTRPGAEA 
HAYNSSILGGRGKGIT (SEQ ID NO:617). Polynucleotides encoding these 
. polypeptides are also encompassed by the invention. 

This gene is expressed primarily in pancreas tumor and to a lesser extent in 
cerebellum. 

25 Therefore, polynucleotides and polypeptides of the invention are useful- as 

reagents, for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, pancreatic tumors. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 

30 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the endocrine system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

35 . such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
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epitopes include chose comprising a sequence shown in SEQ ID NO: 340 as residues: 
Pro-22 co Phe-33. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of pancreatic tumors. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 108 

This gene maps to chromosome 17 and therefore polynucleotides of the 
invention can be used in linkage analysis as a marker for chromosome 17. In specific 
embodiments, polypeptides of the invention comprise the sequence: 
10 MLPALASCCHFSPPEQAARL 

KKFQPMNKIERSILHDVVEVAGLTSFSFGEDDDCRYVMIFKKEFAPSDEELDSY 
RRGEEWDPQKAEEKRNXKELAQRQ (SEQ ID NO: 6 IS); EEEAAQQGPVVV 
SPASDYKDKYSHLIGKGAAKDA 

IRAKKRLRQSGE (SEQ ID NO:6 19); PPRRPAQLPLTPGAGQGAGRDKAAAIRA 
1 5 HPGAPPLNHLLP (SEQ IDNO:620); AVPQAGGKQ VFDLSPLELGYVRGMCVCV ■ 
(SEQ ID NO:621) and/or SVILPALASCCHFSPPEQA.A.RLKKLQEQEKQQKVEFRK 

rimekevsdrqdsgqikkkfqpmnkiersilhdvvevagltsfsfgedddcryy 

mifkkefapsdeeldsyrr^ 

spasdykdkyshugkgaakdaahmlqankt^ 

20 IRAKKRLRQSGE (SEQ ID NO:622). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. The translation product of this gene shares 
sequence homology with FSA-1 which may play a role as a structural protein 
component of the acrosome. 

This gene is expressed primarily in fetal kidney and sperm. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, male reproductive disorders, especially involving acrosomal disfunction. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

30 providing immunological probes for differential identification of the tissue(s) or cell 

• type(s). For a number of disorders of the above tissues or cells, particularly of the male 
reproductive system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

35 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 



WO 93/54963 



PCT/US98/11422 



84 

individual not having the disorder. Preferred epitopes include those comprising a . 
sequence shown in SEQ ID NO: 341 as residues: Glu-8 to Asn-35. 

The tissue distribution and homology to FSA-1 indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for treatment of infertility due to 
5 acrosomal disfunction of sperm. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 109 

This gene is expressed primarily in pituitary and to a lesser extent in 
epididymus. . ' . 

10 Therefore, polynucleotides' and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, male reproductive disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for * 

15 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the male reproductive system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

20 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the ' 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 342 as residues: Met-1 to Trp-6. 

Because the gene is found in both pituitary and epididymus, this indicates that 

25 polynucleotides and polypeptides corresponding to this gene are useful for treatment 
and diagnosis of male reproductive disorders. This may involve a secreted peptide 
produced in the pituitary targeting the epididymus. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 110 

30 In specific embodiments, polypeptides of the invention comprise the sequence: 

LLCPVLNSGXSWNFPHPSQPEYSFHGFHSTRLWI (SEQ ID NO:623); and/or 
PSTPWFLFLLGLTCPFSTSHPRWDSIPP (SEQ ID NO:624). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 
This gene is expressed primarily in resting T-cells.- . 

35 Therefore, polynucleoddes and polypeptides of the invention are useful as 

reagents for differential identificadon of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, T-cell disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
5 lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or ' 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e.,. the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
10 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treatment and diagnosis of certain immune 
disorders, especially those involving T -cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 111 

15< This gene is expressed primarily in cerebellum and whole brain and to a lesser 

extent in infant brain and fetal kidney. 

Therefore, polynucleotides^nd polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

20 not limited to, neurological disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues- or cells, particularly of the central nervous system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

25 ■ cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
' in healthy tissue or bodily fluid from' an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 344 as residues: 

30 Asp-48 to Gly-55. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of neurological 
disorders. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 112 

The translation product of this gene shares sequence homology with yeast 
mitochondrial ribosomal protein homologous to ribosomai protein sl5 of E.coli- which 
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is thought to be important in the early assembly of ribosomes (See Accession No. . 
M380 16). This gene maps to chromosome 1 , and therefore, may be used as a marker 
in linkage analysis for chromosome 1'. 

This gene is expressed primarily in developmental tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include,. but are 
not limited to, development of cancers and tumors in addition to healing wounds. . 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in ' 
providing immunological probes for differential identification of the tissue(s) or eel] 
cype(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and developmental expression of this gene at significantly higher or lower 
levels may be routinely detected in certain ussues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to ribosomalprotein sl5 of E. coli 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
diseases related to the assembly of ribosomes in the mitochondria which . is important in 
the translation of RNA into protein. Therefore, this indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and intervention of 
multiple tumors as well as in healing wounds which are thought to be under similar 
regulation as developmental ussues. Protein, as well as, antibodies directed against the 
protein have utility as tumor markers, in addition to immunotherapy targets, for the 
above listed tumors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 113 

The translation product of this gene shares sequence homology with human 
poliovirus receptor precursors which are thought to be important in viral binding and 
uptake. Preferred polypeptide fragments comprise the following amino acid sequence: 
ELSISISNVALADEGEYTCSIFTiVIPVRTAKSLVTVLGPQKPnTGYKSSLREKDT. 
ATLNCQSSGSKPAARLTWRKGDQELHGEPTKIQEDPNGKTFTVSSSVTFQVTR 
EDDGASIVCSVNHESLKGADRSTSQRmVLYTPT.AN[IRPDPPHPREGQKLLLHC 
EGRGNPVPQQYLWEKEGSWPL^ITQESALIFPFLNKSDSGTYGCTATSNMGS 
YKAYYTLNVND (SEQ ID NO:625). Also preferred are polynucleotide fragments 
encoding these polypeptide fragments (See Accession No. gnl!PIDidl002627). - "■ 
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This gene is expressed almost exclusively in human brain tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
5 not limited to, susceptibility to viral disease and diseases of the CNS especially cancers 
of that system. Similarly, polypeptides and antibodies directed to these, polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
. or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the central nervous system, expression of this gene at significantly higher or lower 
10 • levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
15 comprising a sequence shown in SEQ ID NO: 346 as residues: Leu-26 to Asp-37, Lys- 
53 to Ser-59. 

The tissue distribution and homology to polio virus receptor precursors indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment and prevention of diseases that involve the binding and uptake of virus 
20 particles for infection. It might also be helpful in genetic therapy where the goal is to 
insert foreign DNA into infected cells. With the help" of this protein, the binding and 
uptake of this foreign DNA might ( be aided. In addition, it is expected that over 
expression of this gene will indicate abnormalities involving the CNS, particularly 
cancers of that system. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 114 

The translation product of this gene shares sequence homology with 
t . Y087_CAEEL hypothetical 28.5 KD protein ZK1236.7 in chromosome III of 
Caenorhabditis elegans in addition to alpha- 1 collagen type III (Se x e Accession No. 
30 gi!537432).' One embodiment for this gene is the polypeptide fragment(s) comprising 
the following amino acid sequence: VPELPDRVHQLHQAVQGCALGRPGFPGGPTH 
SGHHKSHPGPAGGDYNRCDRPGQVHLHNPRGTGRRGQLHPTAGPGVHRRA 
CPSQQLPHRLGPGVPCPSPSLTPVLPSWTQSWCG LPGYTSSS (SEQ ID 
NO:630). An additional embodiment is the polynucleotide fragment(s) encoding these 
35 polypeptide fragments 

This gene is expressed primarily in brain cells and to a lesser extent in activated 
B and T cells. 
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Therefore, polynucleotides and polypeptides of the invention- are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurodegeneration and imunological disorders. Similarly, polypeptides 

5 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the neural and immune systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g.; cancerous and wounded tissues) or bodily fluids (e.g., serum, 

10 plasma, urine, synovial fluid- or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
. NO: 347 as residues: Glu-34 to Glu-39, Gly-5 1 to Ser-72, Ala-88 to Glu-93, Gln-100 

15 toVaI-105. 

The tissue distribution and homology to Y087_CAEEL hypothetical 28.5 KD 
protein ZK1236.7 in chromosome- III of Caenorhabditis eiegans as well as to a 
conserved alpha- 1 collagen type HI protein indicates that polynucleotides and' 
polypeptides corresponding to this gene are useful for the detection and treatment of 

20 neurodegenerative disease states and behavioral disorders such as Alzheimer's Disease/ 
Parkinson's Disease, Huntingtons' Disease, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder and panic disorders. Because the gene is expressed in 
cells of lymphoid origin, the natural gene product may be involved in immune 
functions. Therefore it may be also used as an agent for immunological disorders 

25 including arthritis, asthma, immune deficiency diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 115 

The translation product of this gene shares sequence homology with alpha 3 
, type DC collagen which is thought to be important in-hyaline cartilage formation via its 

30 ability to uptake inorganic sulfate by cells (See Accession No. gil975657). One 

embodiment of this gene is the polypeptide fragment comprising the following amino 
acid sequence: SLRRPRSAAXQTLTTFLSSVSSASSSALPGSREPCDPR.^PPPPR 
SGS.AASCCSCCCSCPRRR.\PLRSPRGSKRRIRQREVVDLYNGMCLQGPAGVPG 
RDGSPGANGIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNY 

35 GIDLGKIAECTFTmRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGP 
LP1E.A^YLDQGSPEMNSTI^^HRTSSVEGLCEGIGAGLVDVAIWVGTCSDYPKG 
D ASTG WNS V S RIHEELP K (SEQ. ID NO:634). An additional embodiment are the' ' 
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polynucleotide fragments encoding this polypeptide fragment. 

This gene is expressed primarily in smooth muscle and to a lesser extent in 
synovial tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, dwarfism, spinal deformation, and specific joint abnormalities as well as 
chondrodysplasias i.e., spondyloepiphyseal dysplasia congenita, familial osteoarthritis, 
Atelosteogenesis type IL metaphyseal chondrodysplasia type Schmid and autoimmune 

10 disorders . Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the skeletal system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 

15 fluids (e:g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to alpha 3 type IX collagen indicates that 

20 polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
and diagnosis of diseases associated with the mutation in this gene which leads to the 
many different types of chondrodysplasias. By the use of this product, the abnormal 
growth and development of bones of the limbs and spine could be routinely detected or 
treated in utero since the protein or muteins thereof could affect epithelial cells early in 

25 development and later the chondrocytes of the developing craniofacial structure. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 116 

The translation product of this gene shares sequence, homology with retro virus- 
related reverse transcriptase which is thought to be. important in viral replication. One 

30 embodiment for this gene is the polypeptide fragments comprising the following amino 
acid sequence: TKKENCRPASLMNIDTK1LNKILMNQ (SEQ ED NO:640). An 
additional embodiment is the polynucleotide fragments encoding these polypeptide " 
fragments (See Accession No. pirlA25313IGNHULl). 

This gene is expressed primarily in human meningima. 

35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential idendfication of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, retroviral diseases such as AIDS, and- possibly certain cancers due to 
transactivation of latent cell division genes. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g./ 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to retrovirus-related reverse transcriptase 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
the detection and treatment of diseases and maladies associated with retroviral infection 
since a functional reverse transcriptase (RT) or RT-like molecule is an integral 
component of the retroviral life cycle. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 117 

The translation product of this gene shares sequence homology with an 
unknown' gene from C. elegans, as well as weak homolog with mammalian metaxin, a 
gene contiguous to both thrombospondin 3 and glucocerebrosidase, is known to be 
required for embryonic development. Preferred polypeptide fragments comprise the 
' following amino acid sequence: MCNLPIKWCRANAEYMSPSGKVPXXHVGNQ 
VVSELGPIVQFVKAKGHSLSDGLEEV 

DEATVGXITHXRYGSPYPWPLXHILAYQKQWEVKRKXK,^IGWGKKTLDQVLE 
DVDQCCQALSQRLGTQPYFFNKQPTELDALVFGHLYTILTTQLTNDELSEKVKN 
YSNLLAFCRRI EQHYFEDRGKGRLS (SEQ ID NO:641); MCNLPIKVVCRANAE 
YMSPSGKVPXXHVGNQVVSELGPIVQFVK (SEQ ID NO:642),. Also preferred are 
polynucleotide fragments encoding these polypeptide fragments (See Accession No. 
gii!326108). ' 

This gene is expressed primarily in fetal tissues and to a lesser extent in 
hematopoietic cells and tissues, including spleen, monocytes, and T cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer; lymphoproliferative disorders: inflammation; chondrosarcoma, 
and Gaucher disease. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in. providing immunological probes for differential identification 
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of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hematopoietic and embryonic systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial 

5 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression levei 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of cancer and other 

10 proliferative disorders. Expression in embryonic tissue and other cellular sources 

marked by proliferating cells indicates that this protein may play a role in the regulation 
or cellular division. Additionally, the expression in hematopoietic cells and tissues 
indicates that this protein may play a role in the proliferation, differentiation, and 
survival of hematopoietic cell lineages. Thus, this gene may be useful in the treatment 

15 of lymphoproliferative disorders, and in the maintenance and differentiation of various 
hematopoietic lineages from early hematopoietic stem and committed progenitor cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 118 

The translation product of this gene shares sequence homology with reverse 

20 . transcriptase which is important in the synthesis of a cDN A chain from an RNA 
molecule, and is a method whereby the infecting RNA chains of retroviruses are 
• transcribed into their DNA complements. One embodiment for this gene is the 
polypeptide fragment comprising the following amino acid sequence: 
iVLXXXNSHITIFTLNVNGLNAPNERilRL.ANWIQSQDQVCCIQETHLTGRDTHRL 

25 KIKGWRKIYQANGKQKK (SEQ ID NO:647). An additional embodiment is the 
polynucleotide fragments comprising polynucleotides encoding these polypeptide 
fragments- (See Accession No. giI2072964). 

This gene is expressed primarily in skin and to a lesser extent in neutrophils. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

30 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer, hematopoietic disorders; inflammation; disorders of immune 
surveillance. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

35 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the epidermis and/or hematopoietic system, expression of this gene at significandy 
higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
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wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. • 
5 The tissue distribution and homology to reverse transcriptase indicates that 

polynucleotides and polypeptides corresponding to this gene are useful foT cancer 
therapy. Expression in the skin also indicates that this gene is useful in wound healing 
and fibrosis. Expression by neutrophils also indicates that this gene product plays a role 
in inflammation and the control of immune surveillance (i.e. recognition of viral 
10 pathogens). Reverse transcriptase family members are also useful in the detection and 
treatment of AIDS. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 119 

■ - K . The translation product of this gene shares sequence homology with reverse 

15 transcriptase which is important in the synthesis of a cDNA copy of an RNA molecule, 
and is a method whereby a retrovirus reverse-transcribes its genome into an inheritable 
DNAcopy. 

This gene is expressed primarily in the frontal cortex of brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological" sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer and neurodegenerative disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

25 of the above tissues or cells, particularly of the CNS and peripheral nervous system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid)'or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

30 * the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to reverse transcriptase suggest that this is 
useful in the treatment of cancer and AIDS. The expression in brain indicates that it 
plays a role in neurodegenerative disorders and in neural degeneration. * 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 120 

One embodiment of this gene has homology to a hypothetical protein in 
Schizosaccharomyces pombe (See Accession No. 2281980).. Another embodiment for 
this gene is the polypeptide fragments comprising the following amino acid sequence: 
5 IYHLHSWIFFHFKRAFCM 

(SEQ ID NO:65I). An additional embodiment is the polynucleotide fragments 
encoding these polypeptide fragments. This gene maps to chromosome 18, and 
therefore, may be used as a marker in linkage analysis for chromosome 18. 

This gene is expressed primarily in adult hypothalamus and to a lesser extent" in 
10 infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurodegenerative disorders; endocrine function; and vertigo. Similarly, 

1 5 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the brain, CNS and 
peripheral nervous system, expression of this gene at significantly higher or lower 
levels ; may be routinely detected in certain tissues (e.g., cancerous and wounded 

20 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for the treatment and diagnosis of 

neurodegenerative disorders; diagnosis of tumors of a brain or neuronal origin; 
treatments involving hormonal control of the entire body and of homeostasis, 
behavioral disorders, such as Alzheimer's Disease, Parkinson's Disease, Huntington's 
Disease, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder and 

30 panic disorder. In addition, the gene or gene product may also play a role in the 

treatment and/or detection of developmental disorders associated with the developing 
embryo. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 121 

35 The translation product of this gene shares sequence homology with the human 

IRLB protein which is thought to be important in binding to a c-myc promoter element 
and thus regulating its transcription (See Accession No. gii33969). This gene maps to 
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chromosome 1 , and therefore, may be used as a marker in linkage analysis for 
chromosome I. 

This gene is expressed primarily in brain and breast and to a lesser extent in a 
variety of hematopoietic tissues and cells. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the dssue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer of the brain and breast; lymphoproliferative disorders; 
neurodegenerative diseases. Similarly, polypeptides and antibodies directed to these 

10 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the CNS, breast, and immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

15 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates thai polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of cancer of the 

20 brain, breast, and hematopoietic system. In addition, it may be useful for the treatment 
of neurodegenerative disorders, as well as disorders of the hematopoietic, system, 
including defects in immune competency and inflammation. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and 
immunotherapy targets for the above listed tumors and tissues. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 122 

The translation product of this gene shares sequence homology with an ATP 
synthase, a key component of the proton channel that is thought to be important in the 
translocation of protons across the membrane. 
30 This gene is expressed primarily in T.cell lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and. for diagnosis- of diseases and conditions, which include, but are 
. not limited to, T cell lymphoma. Similarly, polypeptides and antibodies directed to these 
35 polypeptides are' useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune sy-stem, expression of this gene at significantly higher or ' 
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lower levels may be routinely delected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
5 fluid from an individual not having the disorder. 

The tissue distribution and homology to ATP synthase indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
of defects in proton transport, homeostasis, and metabolism, as well as the diagnosis 
and treatment of lymphoma. Because the gene is expressed in cells of lymphoid origin, 
10 the natural gene product may be involved in immune functions. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, and leukemia 

FEATURES OF PROTEIN ENCODED BY GENE NO: 123 

15 This gene maps to chromosome 15, and therefore, may be used as a marker in 

linkage analysis for chromosome 15. 

This gene is expressed primarily in a variety of fetal tissues, including fetal 
liver, lung, and spleen, and to a lesser extent in a variety of blood cells, including 
eosinophils and T cells. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
. biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer (abnormal cell proliferation); T cell lymphomas; and hematopoietic 
disorders. Similarly, polypeptides. and antibodies directed to these polypeptides are 

25 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the fetus and immune system, expression of this gene at significantly higher or lower 
levels may be roudnely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

30 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of conditions 

35 involving cell proliferation. Expression of this gene in fetal tissues, as well as in a 
variety of blood cell lineages indicates that it may play a role in either cellular 
proliferation; apoptosis; or cell survival. Thus it may be useful in the management and 
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treatment of a variety of cancers and malignancies. In addition, ks^expression in blood 
cells suggest that it may play additional roles in hematopoietic disorders and conditions, 
and could. be useful in treating diseases involving autoimmunity, immune modulation, 
immune surveillance, and inflammation.. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 124 

This gene is expressed primarily in placenta and to a lesser extent in pineal siand 
and rhabdomyosarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental, endocrine, and female reproductive disorders. Similarly, 
- polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
1 5 a number of disorders of the above tissues or cells, "particularly of the [insert system 
where a related disease state is likely, e.g., immune], expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
^ 20 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 357 as residues: 
Leu-69 to Val-76. 

The tissue distribution indicates that polynucleotides and polypeptides 
25 corresponding to this gene are useful for diagnosis and treatment of disorders in 

development. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and immunotherapy targets for the above listed tumors and 
tissues. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 125 

This gene is expressed primarily in benign prostatic hyperplasia. 

Therefore, polynucleotides and' polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of benign prostatic hyperplasia. Similarly, 
35 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive- 
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system, expression of this gene at significantly higher or lower levels may be routinely , 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
5 . expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of benign prostatic 
hyperplasia. Protein, as well as, antibodies directed against the protein may show utility 
10 as a tumor marker and immunotherapy targets for the above listed tumors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 126 

This gene is expressed primarily in apoptotic T-cells and to a lesser extent in - 
suppressor T cells and ulcerative colitis. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases involving premature apoptosis, and immunological and 
gastrointestinal disorders. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

25 another tissue or ceil sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of disorders involving 

30 inappropriate levels of apoptosis, especially in immune cell lineages. Because the gene 
is expressed in cells of lymphoid origin, the natural gene product may be involved in 
immune functions. Therefore it may be also used as an agent for immunological 
disorders including arthritis, asthma, immune deficiency diseases (such as AIDS), and 
leukemia. 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 127 

This gene is expressed primarily in Raj i ceils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 ' biological sample, and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation and T cell autoimmune disorders. Similarly /polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression of 

10 this gene at significantly higher or lower level's may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

15 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 360 as residues: Asp-23 to Gly-29. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of inflammation and T 
. cell autoimmune disorders. Because the gene is expressed in cells of lymphoid origin, 

20 the natural gene product may be involved in immune functions. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases (such as AIDS), and leukemia." 

FEATURES OF PROTEIN ENCODED BY GENE NO: 128 

25 The translation product of this gene shares sequence homology with an C 

elegans coding region C47D12.2 of unknown function (See Accession No. 
gnllPIDIe348986). One embodiment for this gene is the polypeptide fragments 
comprising the following amino acid sequence: EDDGFNRSIHEVILKNrrWY 
SERVLTEISLGSLLILVVIRT^ 

30 AAQRHSLFSLLSKKHNKVLEQAT 

LEIINSCLTNSLHHNPNTVALLYKRDLFEQ 

QAGS (SEQ ID NO:657); EDDGFNRS.IHEVILKNITWYSERVLTEISLGSLLILVV 
(SEQ ID NO:65S); RTIQ YNMTRTRD KYLHTNCLAALANMS AQFRSLHQ Y AAQ 
RIISLFSLLSKKHN (SEQ ID NO:659); KKHNKVLEQATQSLRGSLSSNDVPLPDY 
35 AQD (SEQ ID NO:661); SCLTNSLHHNPNLVYALLYKRDLFEQFRTHPSFQD 
DV1QNIDLVTSFFSSRLLQAGS (SEQ ID NO:660). An additional embodiment is the 
polynucleotide fragments encoding these polypeptide fragments. This gene maps to' 
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chromosome 13, and therefore, may be used as a marker in linkage analysis for 
chromosome 18. 

This gene is expressed primarily in smooth muscle and to a lesser extent in fetal 

liver. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, atherosclerosis and other cardiovascular and hepatic disorders. Similarly, 
polypeptides and antibodies directed to these polypepddes are useful in providing 

10 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the circulatory, 
system, expression of this gene at significantly higher or lower levels may be routinely 
detectedin certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 

15 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from- an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of circulatory system 

20 disorders such as atherosclerosis, hypertension, and thrombosis . In addition, the tissue 
distribution indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the detection and treatment of liver disorders and cancers (e.g. 
hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and conditions that are 
attributable to the differentiation of hepatocyte progenitor cells). In addition the 

25 expression in fetus would suggest' a useful role for the protein product in developmental 
abnormalities, fetal deficiencies, pre-natal disorders and various would-healing models 
and/or tissue trauma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 129 
30 The translation product of this gene shares sequence homology with a ribosomal 

protein which is thought to be important in cellular metabolism, in addition to the 
C.elegans protein F40F1 1 . 1 which does not have a known function at the current time 
(See Accession No. gnllPIDIe244552 ). Preferred polypeptide fragments comprise the 
following amino acid. sequence: 
35 M.ADIQTERAYQKQPTIFQNKKRVLLGETGKEKLPRVTNKNIGLGFKDT 

PRRLLRGTYIDKKCPFTGNVSIRGRILSGVVTQDEDAEDHCHPPRLSALHPQVQ 
PLREAPQEHVCTPVPL LQGRPDR (SEQ ID NO:662): MKMQRTIVIRRDYLH - 
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YIRKYNRFEKRHKNMSVHl^ 

AAGTKKQFQKF (SEQ ID NO:663); MADIQTERAYQKQPTIFQNKKRVLLGET 
GK (SEQ ID NO:664); HCHPPRLSALHPQVQPLREAPQEHVCTPVPL LQGRPDR 
(SEQ ID NO:666); NIGLGFKDTPRRLLRGTYIDKKCPFTGNVSIRGRILSGVVTQ 

5 (SEQ ID NO:669); MICMQRTIVIRRDYLHYIRKYNRPEKRHKNMSVHLSP (SEQ 
ID NO:667); CFRDVQIGDIVTVGECRPLSKTVRFNVLKVTIC^AGTKKQFQKF 
(SEQ ID NO:668). Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 

This gene is expressed primarily in Wilm's tumor and to a lesser extent in 

10 thymus and stromal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases affecting RNA translation. Similarly, polypeptides and 

15 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or ceirtype(s). For a numbenof disorders 
of the above tissues or cells, particularly of the Wilm's tumors, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

20 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 362 as residues: 
Thr-1 1 to Asp-20. 

25 The tissue distribution and homology to a ribosomal protein indicates that 

polynucleotides and polypeptides corresponding to this gene are useful for diseases 
affecting RNA translation. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 130 
30 The translation product of this gene shares sequence homology with a yeast 

DNA helicase which is thought to be important in global transcriptional regulation (See 
Accession No. gnlIPIDIe243594). One embodiment for this gene is the polypeptide 
fragments comprising the following- amino acid sequence: IFYDSDWNPTVDQQA 
MDRAHRLGQTKQVTVYRLICKGTE (SEQ ED 

35 NO:670); TRMIDLLEEYMVYRKHTYXR^ 

FVFLLSTRAGGLGINLTAXDTVHF (SEQ ID NO:67 1); TRMIDLLEEYMVYRK 
HTYXRLDGSSKISERRDM (SEQ ID NO:674); RRDMV ADFQNRNDIFVFLL 
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STRAGGLGINLTAXDTYHF (SEQ ID NO:675) , JFYDS D WNPTVDQQ AMD 
RAHRLGQTKQVTVYRLICKG (SEQ ID NO:676); RLICKGTDEERILQRAK 
EKSEIQRMVISG (SEQ ID NO:678). An additional embodiment is the polynucleotide 
fragments encoding these polypeptide fragments. 

5 This gene is expressed primarily in amygdala. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases and disorders of the brain. Similarly, polypeptides and 

10 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the central nervous" system, expression of 
this gene at significantly higher or lower levels may-be -routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

15 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to a DNA helicase indicates that 
20 polynucleotides and polypeptides corresponding to this gene are useful for diseases 

affecting RNA transcription, particularly developmental disorders and healing wounds 
since the later are though to approximate developmental transcriptional regulation. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 131 

This gene is expressed primarily in prostate and to a lesser extent in amygdala 
and pancreatic tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, prostate enlargement and gastrointestinal disorders, particularly of the 
pancreas and gall bladder. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the reproductive system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
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the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution- indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of prostate diseases, 
5 including benign prostatic hyperplasia and prostate cancer. In addition, the tissue 

distribution in tumors of the pancreas indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and intervention of these tumors, in 
addition to other tissues where expression has been indicated. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
10 immunotherapy targets for the above listed tumors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 132 

This gene is expressed primarily in adult lung and to a lesser extent in 
hypothalamus. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis' of diseases and conditions, which include, but are 
not limited to, pulmonary diseases and neurological disorders. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 

20 probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the pulmonary and respiratory 
systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 

25. taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribudon indicates that polynucleoddes and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of pulmonary and 

30 . respiratory disorders such as emphysema, pneumonia, and pulmonary edema and 
• emboli. In addidon, the tissue distribution indicates that polynucleotides and 
polypepddes corresponding to this gene are useful for the detection/treatment of 
neurodegenerative disease states and behavioral disorders such as Alzheimer's Disease, 
Parkinson's Disease, Huntington's Disease, schizophrenia, mania, dementia, paranoia, 

35 obsessive compulsive disorder and -panic disorder. In addition, the gene or gene 
product may also play a role in the treatment and/or detection of developmental 
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disorders associated with the developing embryo, sexually-linked disorders, or 
disorders of the cardiovascular system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 133 
5 This gene is expressed primarily in human liver. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cirrhosis of the liver and other hepatic disorders. Similarly, polypeptides 

10 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the digestive system, expression 
. of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

15 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
20 corresponding to this gene are useful for diagnosis and treatment of liver disorders such 
-. as cirrhosis, jaundice, and Hepatitus. Protein, as well as; antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tissues. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 134 

This gene is expressed primarily in fetal kidney and to a lesser extent in fetal 
liver and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, development and regeneration of liver and kidney and immunological 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

35 the digestive and excretory systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
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another tissue or ceil sample taken from an individual having such a disorder, relative to 
the standard gene expression level; i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 367 as residues: Pro-70 to Arg-77, Tyr- 
5 102 toThr-107. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of diseases of the 
kidney and liven such as cirrhosis, kidney failure, kidney stones, and liver failure, 
hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and conditions that are 
10 attributable to the differentiation of hepatocyte progenitor cells. In addition the 

expression in fetus would suggest a useful role for the protein product in developmental 
abnormalities, fetal deficiencies, pre-natal disorders and various would-healing models 
- and/or tissue trauma. 



15 FEATURES OF PROTEIN ENCODED BY GENE NO: 135 

This gene is expressed primarily in brain, bone marrow, and to a lesser extent in 
placenta, T cell, testis and neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neurodegenerative and immunological diseases and cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell cype(s). For 
a number of disorders of the above tissues or cells, particularly of the nervous and 

25 immune systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

30 individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 368 as residues: Met-i to His-6. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 

35 Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, obsessive 

compulsive disorder and panic disorder. In addition, the gene' or gene product may also 
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play a role in the treatment and/or detection of developmental disorders associated with 
the developing embryo, or sexually-linked disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 136 
5 Translatation product of this gene is homologous to the human WD repeat 

protein HANI I. Preferred polypeptide fragments comprise the following amino acid' 
sequence: 

MSLHGKRKEIYKYEAPWTVYAMNWSVRP 

LDEESSEHCRNTFDHPYPTTKLiVrW'PDTKGVYPDLLATSGDYLRVWVGETET 
10 RLECLLNNNKNSDFC.APLTSFDWNEVDPYLLGTSSIDTTCTTWGLETGQVLGRV 
NXVSGHVKTQLLAHDKEVYT)L\FSR,\GGGRDiVIFASVGADGSVR.MFDLRHLEH 
STIIYIEDPQHHPLLRLCWKQDPNYLATNL^lDGMEVVILDVRVPAHLXPGTTm 
H V S M ALLGPHIHP ATS ALQRMTTRLS S GTSS KCPEPLRTLS WPTQLXGEINN V Q 
WASTQPELSPSATTTAWRYSECSVGGAVPTRQGLLYFLPLPHPQS (SEQ ID 
15 NO:679); MSLHGKRKEIYKYTEAPWTVYAMNW^ 

" EEYNNKVQLVGLDEESSEFICRNTFDHPYPTTKL.VIWIPDTKGVYPDLLATSGDY 
LRVWRVGETETRLECLLNNNKNSDFCAPLTSFDWNEVDPYLL (SEQ ID 
NO:680); SFDWNEVDPYLLGTSSIDTTCTIWGLETGQVLGRVNLVSGHVK 
TQLIAHDKEVYDLAFSR^GGGRDMFASVGADGSVRN^ 
20 HPLLRLCWNKQDPN^TATMAiMDG^VVILDVRVPAHLXPGTTI (SEQ ID 
NO:68 1); VG.AI^GSVILVIFDLRHLEHSTIIY^DPQHHPLLRLCWNKQDPNYLA 
TMAMDGiVffiVVILDVRVPAHLXPGTTIEHV^^^ 

SGTSSKCPEPLRTLSWPTQLXGEINNVQWASTQPELSPSATTTAWRYSECSVG 
G A VPTRQGLL YFLPLPHPQS (SEQ ID NO:6S2), Also preferred are polynucleotide 

25 fragments encoding these polypeptide fragments. 

This gene is expressed primarily in placenta, embryo, T cell and fetal lung and 
to a lesser extent in endothelial, tonsil and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

30 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immunological and developmental diseases in addition to cancers. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

35 immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue of- 
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cell sample taken from an individual having such a- disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 369 as residues: Gly-19 to Gln-28, Pro-36 to Phe-42. 
5 The tissue distribution in tumors of colon, ovary, and breast origins indicates 

that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and intervention of these tumors, in addition to other tumors where 
expression has been indicated. Protein, as well as, antibodies directed" against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the above 
10 listed tumors and tissues. Because the gene is expressed in cells of lymphoid origin, the 
natural gene product may be involved in immune functions. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, and leukemia. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 137 

This gene is expressed primarily in TNF and INF induced epithelial cells, T 
cells and kidney. ' 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the ussue(s) or cell type(s) present in a 
20 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammatory conditions particularly inflammatory reactions in the 
kidney. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of renal 
25 . system, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.s., cancerous, and wounded tissues) or bodilv fluids (e.<* 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample . 
. taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
30. individual not having the disorder. Preferred epitopes include those comprising a 

sequence shown in SEQ ID NO: 370 as residues: Thr-67 to Gly-72, Gin- 132 to Ala- 

- 145, Arg-150 to Pro- 157. ~. " ~ ~ 

The tissue, distribution' indicates that the protein products of this gene are useful 
for treating the damage caused by inflammation of the kidney. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 138 

This gene maps to chromosome 1 , and therefore, may be used as a marker in 
linkage analysis for chromosome 1 (See Accession No. D63485). 

This gene is expressed primarily in breast cancer and colon cancer and to a 
5 lesser extent in thymus and fetal spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification, of the tissue(s). or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancers, especially of the breast and colon tissues. Similarly, 
10 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the dssue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
- expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
15 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in tumors of colon and breast origins indicates that 
20 polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 

and intervention of these tumors, in addition to other tumors where expression has been 
indicated. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tumors and tissues. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 139 

This gene maps to chromosome 17, and therefore, can be used as a marker for 
linkage analysis from chromosome 17. 

This gene is expressed primarily in CD34 positive cells, and to lesser extent in 
activated T-cells and neutrophils. 
30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immunologically related diseases and hematopoietic disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
35 immunological" probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system 
and hematopoietic system, expression of this gene at significantly higher or lower levels 



WO 98/54963 PCT/US98/1 1422 

108 

may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., .serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
5 fluid from an individual not having the disorder. 

The tissue distribution in CD34, T-cell and neutrophils indicates that . 
polynucleotides and polypeptides corresponding to this gene are useful for treatment 
and diagnosis of hematopoietic disorders and immunologically related' diseases, such as 
anemia, leukemia, inflammation, infection, allergy, immunodeficiency disorders, 
10 arthritis, asthma, immune deficiency diseases such as AIDS. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 140 

This gene was recently cloned by another group, who called the gene 
KIAA0313 gene. (See'Accession No. dl021609.) Preferred polypeptide fragments 
15 comprise the amino acid sequence: ■ ■ 

lyatatvisspstexlsqdqgdrasldaadsgrgswtscssgshdniqtiq 
hqrswetlpfghthfdysgdpAglwassshmdqimfsdhstkynrqnqsres 
leqaqsraswasstgywgedsegdtgtikrrggkdvsieaesssltsytteetk 
pvpmpahiavassttkgliarkegryreppptppgyigipitdfpeghshparkp 
20 pdynvalqrsrmvarssdtagpssvqqphghptssrpvnkpqwhkxnesdpr 
lapyqsqgfsteededeqvsav (seq id no:683); hmdqimfs dhstkynrq 
nqsresleqaqsraswasstgywge (seq id no:684); s vtteetkp vpmp 
ahiavassttkgliarkegryreppptppgyigipitd (seq id no:685); and 
valqrsrmvarssdtagpssvqqphghptssrpvnkpqw 

25 HKXNESDPRLAPYQSQGF (SEQ ID NO:6S6). Also preferred are polynucleotide 
fragments encoding these polypeptide fragments. This gene maps to chromosome 4, 
and therefore, may be used as a marker in linkage analysis for chromosome 4 (See 
Accession No. AB0023 11 ). . 

This gene is expressed primarily in ovarian cancer, tumors of the Testis, brain, 

30 and colon. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to,, ovarian, testicle, brain and colon cancers. Similarly, polypeptides and 
35 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the male and female reproductive systems, 
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expression of this gene ac significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
5 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The -tissue distribution in tumors of colon, ovary, testis, and brain origins 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and intervention of these tumors, in addition to other tumors where 
10 expression has been indicated. Protein, as well as, antibodies directed against-the 

protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tumors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 141 

15 This gene is expressed primarily in spleen and colon cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, colon cancer and immunological disorders. Similarly, polypeptides and 

20 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the gastrointestinal trace and immune 
systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., 

25 serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in tumors of colon, ovary, and breast origins indicates 

30 that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis, and intervention of these tumors, in addition to other tumors- where 
expression has been indicated. Protein, as- well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tumors and tissues. 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 142 

Translation product is homologous to T cell translocation protein, a putative zinc 
finger factor (See Accession No. 340454), as well as to the G-protein coupled receptor 
TM5 consensus polypeptide (See Accession No. R50734). Preferred polypeptide 
5 fragments comprise the following amino acid sequence: 

. CLLFVFVSLGiMRCLFWTIVWVLYXKHKCNTVLLCY-HLCSI (SEQ-ID NO:687); 
ACSKLIPAFEMVMR.^KDNVYHLDCFACQLCNQRXCVGDKFFLKNNXXLCQT 
DYEEGLMKEGYAPXVR (SEQ ID NO:683). Also preferred are polynucleotide 
fragments encoding these polypeptide fragments. 
10 . This gene is expressed .primarily in fetal brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
• not limited to, neurological disorders including brain cancer. Similarly, polypeptides 
15 and antibodies directed to these polypeptides are useful ia providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the Central Nervous System, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
20" plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
25 corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioral disorders such as Alzheimer's Disease, Parkinson's 
Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder and panic disorder. In addition, the gene or gene product may also 
- play a role in the treatment and/or detection of developmental disorders associated with 
30 the developing embryo. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 143 

Translation product for this gene has significant homology to the Fas ligand, 
which is a cysteine-rich type II transmembrane protein/tumor necrosis factor receptor 
35 homolog. Mutations within this protein have been shown to result in generalized 
lymphoproliferative disease leading to the development of lymphadenopathy and 
autoimmune disease (See Medline Article No. 94135 175). Preferred polypeptide 
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fragmencs comprise the following amino acid sequence: 

SALSEPGAPDRRRPCPESVPRRPDDEQWPPPTALCLDVAPLPPSS (SEQID 
NO:689). Also preferred are polynucleotide fragments encoding these polypeptide 
fragments (See Accession No. 473565). 

5 This gene is expressed primarily in osteoblasts, lung, and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the ussue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, osteoblast-related, pulmonary, neurological, and immunological 
. 10 diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
. . or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the skeletal and nervous systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and'wounded 

15 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those ' 
comprising a sequence shown in SEQ ID NO: 376 as residues: Trp-33 to Thr-40, Lys- 

20 45toIle-63. 

The tissue distribution in osteoblasts, lung, and brain combined with its 
homology to the Fas ligand indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and intervention of these tumors, in 
• addition to other tumors where expression has been indicated. Protein, as well as, 

25 antibodies directed against the protein may show utility as a tumor marker and/or 

immunotherapy targets for the above listed tumors and tissues. Because the Fas ligand 
gene is known to be expressed in cells of lymphoid origin, the natural gene product 
may be involved in immune functions. Therefore it may be also used as an agent for 
immunological disorders including asthma, immune deficiency diseases such as AIDS 

30 and leukemia, and various autoimmune disorders including lupus and arthritis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 144 

This gene shares sequence homology with a 2 1 .5 KD transmembrane protein in 
the SEC15-SAP4 intergenic region of yeast. (See Accession No. 1723971.) Preferred 
35 polypeptide fragments comprise the amino acid sequence: 

AHASESGER\V^^^CCGVRFGLRSEAJGRSCCHDGPGGLVANRGRRFKWAIEL 
SGPGGGSRGRSDRGSGQGDSL^TVGYLDKQVPDTSVQETDRILVEKRCWDLAL 
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GPLKQIPMNLFMYMA 

KFLQGLVYLIGNLMGLALAVYKCQSMGLLPTHASDWLAFIEPPERiMEFSGG 
GLLL (SEQTD NO:691); PVGYLDKQVPDTSVQETDRELVEKRCWDL^LGPLKQ 
IPMNLF1 (SEQ ID NO:693); and ATFKMLES SS QKFL QGL V YLIGNLMGLAL A V 
5 YKCQSMGLLPTHASD (SEQ ID NO:692). Also preferred are polynucleotide 
fragments encoding these polypeptide fragments. 

This gene is expressed primarily in osteoclastoma, hemangiopericytoma, liver, 

lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
10 , reagents for differential identification of the tissue(s) or cell rype(s) present in a 

biologicalsample and for diagnosis of diseases and conditions, which include, but are 
not limited to, osteoclastoma, hemangiopericytoma, liver and lung tumors. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the above tissue(s) or cell 
. 15 type(s). For a number of disorders of the above tissues or cells, particularly of the lung 
and liver systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
ceil sample taken from an individual having such a disorder, relative to the standard 
20 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosing osteoclastoma, 
hemangiopericytoma, liver and lung tumors. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 145 

Translation product of this gene shares homology with the glucagon-69 gene 
which may indicate this gene plays a role in regulating metabolism. (See Accession No. 
A603 IS) One embodiment for this gene is the polypepdde fragments comprising the 
30 following, amino acid sequence: 

PTTKIDIMEKKKHIQI^^ (SEQ ID 

NO:694). "An additional' embodiment is the polynucleotide fragments encoding these 
polypeptide fragments. 

This gene is expressed primarily in brain, kidney, colon, and testis. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a : 
biological sample and for diagnosis of diseases and conditions, which include/but are 
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not limited to, brain, kidney, colon, and testicular cancer. Similarly, polypeptides; and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders " 
of the above tissues or cells, particularly of the male reproductive system, neurological, 
5 circulatory, and gastrointestinal systems, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum; plasma, urine,. synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 

10 or bodily fluid from an individual not having the disorder. 

The tissue distribution in tumors of brain, kidney, colon, and testis origins, 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and intervention of these tumors, in addition to other tumors where 
expression has been indicated. Protein, as well as, antibodies directed against the 

1 5 protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tumors and tissues. The tissue distribution indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the detection/treatment of 
neurodegenerative disease states and behavioral disorders such as Alzheimer's Disease, 
Parkinson's Disease, Huntingtons Disease, schizophrenia, mania, dementia, paranoia, 

20 obsessive compulsive disorder and panic disorder. In addition, the gene or gene 
product may also play a role in the treatment and/or detection of developmental 
disorders associated with the developing embryo, sexually-linked disorders, or 
disorders of the cardiovascular system. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 146 

The translation product of this gene shares sequence homology with goliath 
protein which is thought to be important in the regulation of gene expression during 
development. Protein may serve as a transcription factor. One embodiment for this gene 
is the polypeptide fragments comprising the following amino acid sequence: 

30 TEHIL\VMrTELRGKDILSYL£KNlSVQMTIAVGTRiViPPKNFSRGSLVFVSISnV 
LMBS S A WLIFYFIQ KIR YTN ARDRNQ RRLGD AAKKAIS KLTTRTV KKG D KETD 
PDFDHCAVCIESYKQNDVVFdLPCKHVFHKSCVDPWLSEHCTCPMCKLNILKA 
LGIV (SEQ ED NO:695); TEHH\VMrTELRGKDILS YLEKNIS VQMT1AVGTRMP ' " 
PKNFSRGSLVFVSISFIVLM IISSAWLIFYF (SEQ ID NO:697); SISFIVLMDSSA 

35 WLIFYHQKIRYTNARDRNQRRLGDA^KKAtS KLTTRTV KKGDKE (SEQ ID 
NO:698); VKKGDKETDPDFDHCAVCIESYKQNDVVRILPCKHVFHKSCVDP 
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WLSEHCTCPMCKLNILKALGIV (SEQ ID NO:S99). An additional embodiment is 
the polynucleotide fragments encoding these polypeptide fragments (See Accession No. 
157535). Moreover, another embodiment is the polynucleotide fragments encoding 
these polypeptide fragments: 
5 MTHPGTEHIIAVMITELRGKDILSYLEKNISVQMTIAVGTRMPPKNFSRGS 

LVFVSISHVLMnSSAWLIFltTIQKJR^TNARDRNQRRLGDAAKlC\ISKLTTRPv - 
KKGDKETDPDFDHCAVCIESYKQNDVVRILPCKHVFHKSCVDPWLSEHCTCP 
: MCKL^Kj^GIVPNLPCTD^AFD^RLTRTQAVNRRSALGDLAGDNSLGLE 
PLRTSGISPLPQDGELTPRTGEINIAVTKEWFIIASFGLLSALTLCYMIIRATASLN 

1.0 ANEVEWF (SEQ ID NO: 696) ;MTHPGTEHHA VMITELRG KDELS YLEKNIS V QM 
TlAVGTRiMPPKNFSRGSLVFVSISFIVLMIISSAWLIFYFIQKIRYTN.ARDRNQRR 
L G D A AKKAIS KLTTRT (SEQ ID NO:700); AAKKAISKLTTRTVKKGDKE 
TDPDFDHCAVCIESYKQNDVVRILPCKHVFHKSCVDPWLSEHCTCPMCKLNIL 
KALGIVPNLPC (SEQ ID NO:701); TQAVNRRSALGDLAGDNSLGLEPLRTSGI 

1 5 SPLPQDGELTPRTGEINIAVTKE^TIIASFGLLS ALTLC^TVIIIRATASLNANEVEW 
F (SEQ ID NO:702); PLHGVADHLGCDPQTRFFVPPNIKQWIALLQRGNCTF 
KEKIS RA AFHN A V A V VI YNNKS KEEP VTMTHPGTEHII A VMrrELRGKDILS YLE 
KNISVQMTIAVGTRMPPKNFSRGSLVFVSISHVLMIISSAWLIFYF1QKIRYTNA 
RDRNQRRLGDAAKKAISKLTTRTVKKGDKETDPDFDHCAVCIESYKQNDVVRI 

20 LPCKHVFHKSCVDPWLSEHCTCPMCKLNILK.ALGIVPNLPCTDNVAFDMERLT 
RTQAVNRRSALGDLAGDNSLGLEPLRTSGISPLPQDGELTPRTGEINIAVTKEW 
FIIASFGLLS ALTLC YMIIRAT AS LN ANE VEWF( SEQ ID NO:703); and 

HGVADHLGa^PQTRFFVPPNIKQVVIALLQRGNCTFKEKISRAAFHNAVAWIY 
NNKSKEE (SEQ ID NO:704). An additional embodiment is the polynucleoude 

25 fragments encoding these polypeptide fragments. When tested against Jurkat cell lines, 
supematants removed from cells containing this gene activated the GAS pathway. 
Thus, it is likely that this gene activates immune cells through the JAKS/STAT signal 
transduction pathway. ~ 

This gene is expressed primarily in macrophage, breast, kidney and to a lesser 

30 extent in synovium, hypothalamus and rhabdomyosarcoma. 

. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, schizophrenia and cancer. Similarly, polypeptides and antibodies directed 

35 to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune and neural system, expression of this gerie at 
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significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to zinc finger protein indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
of schizophrenia, kidney disease and other cancers. The tissue distribution in 
macrophage, breast, and kidney origins indicates that polynucleotides and polypeptides 

10 corresponding to this gene are useful for diagnosis and intervention of tumors within 
these tissues, in addition to other tumors where expression has been indicated. Protein, 
as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tumors and tissues. Because the gene 
is expressed in cells of lymphoid origin, the natural gene product may be involved in 

15 immune functions. Therefore it may be also used as an agent for immunological 

disorders including arthritis, asthma, immune deficiency diseases such as AIDS, and 
leukemia. •„ - 

FEATURES OF PROTEIN ENCODED BY GENE NO: 147 

20 The translation product of this gene shares sequence homology with HNP36 

protein, an equilibrative nucleoside, transporter, which is thought to be important in 
gene transcription as well as serving as an important component of the nucleoside 
transport apparatus (See Accession No. 1845345). One embodiment for this gene is 
the polypeptide fragments comprising the following amino acid sequence: 

25 MSGQGLAGFFASVAiMICALASGSELSES.AFGWrTACAVnLTIICYLGLPRLEFYR 
YYQQLKLEGPGEQETKLDLISKGEEPFLAGKEESGVSVSNSQPTNESHSIKAILK 
NISVLAFSVCFIFriTIGWPAVTVEVKSSlAGSSTWERYFIPVSCFLTFNIFDWLG 
RSLTAVFMWPGKDSRWLPSWXLARLVFVPLLLL^ 
FFMAAFAFSNGYLASLCMCFGPKXVKPAE^ 

30 PSCSGQLCDKGWTEGLPASLPVCLLPLPS ARGDPEWSGGFFF (SEQ ID 
NO:705);MSGQGLAGFFASVAiVnCMASGSELSES.AFGWITACAVIILTnC 
YLGLPRLEFYRYYQQLKLE GPGEQETKLDLISKGEEPRAGKEESGVSVSNSQ 
PTNESHSI (SEQ ID NO:706); SGVSVSNS QPTNES HS IKAIL KNI S VL AFS V CFI 
FTITIGMFPAVTVEVKSSIAGSSTWERYFIPVSCFLTFNIFDVVLGRS (SEQ ID 

35 NO:707),TIGMFPAVTVEVKSSIAGSSW 

iWPGKDSRWLPSWXL 4 ARLVFVPLLLLCNIK PRRYLTWFEHD A (SEQ ID 
NO:708);FGPKKVKPAEAETAEPSWPSSCVWVWHWGLFSPSCSGQLCDK 
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GWTEGLPASLPVGLLPLPS ARGDPEWSGGFFF (SEQ ID NO:709). An additional 
embodiment, is the polynucleotide fragments encoding these polypeptide fragments. - 

This gene is expressed primarily in eosinophils and aortic endothelium and to a 
lesser extent in umbilical vein endothelial cell and thymus. 

5 Therefore, polynucleotides and polypeptides of the invention are- useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, hematopoietic disease. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 

10 identification of the dssue(s) or cell type(s). For. a number of disorders of the above 
tissues or cells, particularly of the circular -system, expression of this gene at 
significantly higher ox lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

15 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to HNP36 protein indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
of blood neoplasias and other hematopoietic disease. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 148 

This gene is expressed primarily in breast cancer cell lines, thymus stromal 
cells, and ovary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential idendfication of the ussue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, endocrine and female reproductive system diseases including breast 
cancer. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
30 type(s). For a number of disorders of the above tissues or cells, particularly of the 

endocrine system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
35 . gene expression level, i.e., the expression-level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
- corresponding to this gene are useful for the diagnosis and treatment of endocrine 
disorders. In addition, the tissue distribution in tumors of thymus, ovary, and breast 
origins indicates that polynucleotides and polypeptides corresponding to this eene are 
5 useful for diagnosis and intervention of these tumors, in addition to other tumors where 
expression has been indicated. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tumors and tissues 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 149 

Translation product of this gene has homology to pmtl and pmt 2, two 
conserved schizosaccharomyces pombe genes. One embodiment for this gene is the 
polypeptide fragments comprising the' following amino acid sequence: 
DDDGFHVPEDPAKHRILDPEGLALGAV 

15 ELPEWFVQEEKQHR1RQLPVGKKEVEHYRKRWREINARPIXXXXXXXXXXX 
XXXXXXLEQTRKKAEA VVNTVDIXRTRES (SEQ ID NO:7 1 0); 
DDDGmVPIEDPAKJlRILDPEG^ALGAVlASSKK,^KRJDLIDNSFNR^^ (SEQ 
ID NO:71 1); KRWREINARPIXXXXXXXXXXXXXXXXXLEQTRKKAE 
A VVNTVDIXRTRES (SEQ ID NO:7 12). An additional embodiment, is the 

20 polynucleotide fragments encoding these polypeptide fragments (See Accession No. 
e!216734). 

This gene is expressed primarily in retina and ovary and to a lesser extent in 
brreast cancer cell, epididymus and osteosarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neuronal growth disorders, cancer and reproductive system disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or ceil 

30 type(s). For a number of disorders of the above tissues or cells, particularly of the 

neural and reproductive system, expression of this gene at significantly higher or lower 
. levels may be routinely detected in certain tissues (e.g., cancerous and wounded ' 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

35 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 382 as residues: Met-1 to Gly-7. v 
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The tissue distribution indicates that polynucleotides and polypeptides 
. corresponding to this gene are useful for the diagnosis or treatment of reproductive 
system disease and cancers. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 150 

' One embodiment for this gene is the polypeptide fragments comprising the following 
amino acid sequence: 

MIKDKGRART^TSSQPAHLCPENPLLHLK,^AVKEKKRNKKKKTIGSPKRIQS 
PLNNKLLNSPAKTLPGACGSPQK1-IDGFLKHEGPPAEKPLEELSASTSGVPGLS 
10 SLQSDPAGCVRPPAP^AGAVEFNDVKTL^ 
LIEEKDLEKLDLVIKYM^ 

VT (SEQ ID NO:713); MIKDKGRART AJLTSS QP AHLCPENPLLHLKAA VKE 
KKRNKKKKTIGSPKRIQ (SEQ ID NO:714); KRIQS PLNNKLL NS P A KT 
LPGACGSPQmDGFLKHEGPPAEKPLEELSASTSGVPGLSSLQSDPAGCVRPP 

15 APNLAGAVEFNDVKTLLREWTTTISDPM (SEQ ID NO:7 15); 

T1SDPNEEDILQWKYCTDLIEEKDLEKI.DLVIKYN1KRLMQQSVE 
SVWN^IAFDFILDNVQVVLQQT'YGSTLKVT (SEQ ID NO:7 16). An additional^ _ 
embodiment is the polynucleotide fragments encoding these polypeptide fragments. 
This gene is expressed primarily in 12 week embryo and to a lesser extent in 

20 hemangiopericytoma and frontal cortex. 

Therefore, polynucleotides and polypeptides of the invention. are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, growth disorders and hemangiopericytoma. Similarly, polypeptides and 

25 ' antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the circular and neural system, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

30 urine, synovial fluid or spinal- fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 383 as residues: Leu-4 to Lys-1 1. 
' 35 The tissue distribution indicates that pplynucleotides and polypeptides 

corresponding to this gene are useful for the treatment of growth disorders, 
hemangiopericytoma and other soft tissue tumors. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 151 

The translation product of this gene has been found to have homology to a 

human DNA mismatch repair procein PMS3. Preferred polypeptide fragments comprise 
5 the following amino acid sequence: FCHDCKFPEASPAMNCEP (SEQ ID N0:717). 

Also preferred are polynucleotide fragments encoding these polypeptide fragments (See 

Accession No. R95250). 

This gene-is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or ceil type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to ? lymphoma, immunodeficiency diseases,, and cancers resulting from 
genetic instability. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 

15 tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

20 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 384 as residues: Met-1 to Lys-6. 

The tissue distribution in neutrophils and the sequence. homology indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis of 

25 Hodgkin's lymphoma, since the elevated expression and secretion by the tumor mass 
may be indicative of tumors of this type. Additionally the gene product may be used as 
a target in the immunotherapy of the cancer. Because the gene is expressed in cells of 
lymphoid origin, the natural gene product may be involved in immune functions. 
' Therefore it may be also used as an agent for immunological disorders including 

30 arthritis, asthma, immune deficiency diseases such as .AIDS, and leukemia. 

Furthermore, its homology to a known DNA repair protein would suggest gene may be 
useful in establishing cancer predisposition and prevention in gene therapy applications. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 152 
35 . This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, infectious diseases and lymphoma. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s) N . For a number of disorders 
5 of the above tissues or cells, particularly of the immune system, expression of this gene 
arsignificantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
10 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment of inflammation and infectious 
diseases. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 153 

One embodiment for this' gene is the polypeptide fragments comprising the 
following amino acid sequence: 

MASSVPAGGHTR.A.GGIFLIGKLDLEASLFKSFQWLPFVLRKKC 
NFFCWDSSAHSLPLHPLSASCSAPACHASDTHLLYPSTRALCPSIFAWLVAPHS 

20 VFRTNAPGPTPSSQSSPVFPVFPVSFMALIVCXLVCC (SEQ ID NO:720); 

MASSVPAGGHTRAGGIFLIGKLDLEASLFKSFQWLPFVLRKKCNFFCWDSSAH 
SLPLHPLSASCSAPACHA (SEQ ID NO:721);FAWLVAPHSVFRTNAPGPTPS 
SQSSPVFPVFPVSFMALIVCXLVCC (SEQ ID NO:722). An additional embodiment 
is the polynucleotide fragments encoding these polypeptide fragments. 

25 This gene is expressed primarily in neutrophils-. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammation and infectious disease. Similarly, polypeptides and 

30 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

35 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene expression level, i.e., the expression level 
' in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
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epitopes include chose comprising a sequence shown in SEQ ID NO: 386 as residues: 
Ser-11 to Pro- 17. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment of infectious diseases and 
5 inflammation, 

FEATURES -OF PROTEIN ENCODED BY GENE NO: 154 

This gene is expressed in multiple tissues including ovary T uterus, adipose 
tissue, brain, and the liver. . 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, uterine,- ovarian, brain, and liver cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

15 for differential identification of the tissue(s) of cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the female reproductive system, expression 
of this gene at' significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) orjbodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

20 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution of this gene indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnostic or therapeutic, uses in . 
25 the treatment of the female reproductive system, obesity, and liver disorders, 
particularly cancer in the above tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 155 

This gene maps to chromosome 3, and therefore, may be used as a marker in 
. 30 linkage analysis for chromosome 3 (See Accession No. D87452). ■ 

This gene is expressed in multiple tissues including brain, aortic endothelial - 
cells, smooth muscle, pituitary, testis, melancytes, spleen, nertrophils, and placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample andfor diagnosis of diseases and conditions, which include, but are 
not limited to, immunological disorders including immunodeficiencies, cancers of the 
brain and the female reproductive system, as well' as cardiovascular disorders, 'such as 
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atherosclerosis and stroke. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nen/ous and immune systems, expression of this gene at 
5 significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, -synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e.,. the Expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
10 The tissue distribution suggest that polynucleotides and polypeptides 

corresponding to this gene are useful in treatment/detection of disorders in the nervous 
system, including schizophrenia, neurodegeneration, neoplasia, brain cancer as well as 
cardiovascular and female reproductive disorders including cancer within the above 
tissues. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 156 

The translation product of'this gene shares sequence homology with the human 
gene encoding cytochrome b561 (See Accession No. PI 0897). Cytochrome 5561 is a 
transmembrane electron transport protein that is specific to a subset of secretory vesicles 
20 containing catecholamines and amidated peptides. This protein is thought to supply 
reducing equivalents to, the in travesicular enzymes dopamine-beta-hydroxylase and 
alpha-peptide amidase. Preferred polypeptides of the invention comprise the amino acid 
sequence: 

MAMEGYWRFLALLGSALLVGFLS 
25 VLMVTGFVnQGlAIIVYRLPWTWKCSKLLMKS 
NHNVNNLANMYSLHSWVGLM 

YSGIVIFGTV1ATALMGLTEKLIFSLRDPAYSTFPPEGVFVNTLGLLILVFGALIF 
WIVTRPQWKRPKEPNSTILHPNGGTEQGARGSMPAYSGNNMDKSDSEL 
NSEVAARKRNLALDEAGQRSTM (SEQ ID NO:724); as well as antigenic fragments 
30 of at least 20 amino acids of this gene and/or biologically active fragments. Also 
preferred are polynucleotide fragments encoding these polypeptide fragments. 
This gene is expressed primarily in anergic T-cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
. reagents for differendal idendflcation of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune system and metabolism related diseases. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
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probes for differential identification of the tissue(s>or ceil type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
5 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein product or RNA of this gene is 
10 useful for treatment or diagnosis of immune system and metabolic diseases or 

conditions including Tay-Sachs disease, phenylketonuria, galactosemia, various - 
porphyrias, and Hurler's syndrome. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 157 

•15 The translation product of this gene shares sequence homology with collagen 

which is important in mammalian development. This gene also shows sequence 
homology with bci-2. (See Accession No. P80988.) Preferred polypeptide fragments - 

comprise the amino acid sequence: PGRAGPSPGLSLQLPAEPGHPA GNLA PL 

TSRPQPLCRIPAVPG (SEQ ID NO:725). Also preferred are polynucleotide 

20 sequences encoding this polypeptide fragment. 

This gene is expressed primarily in HL-60 tissue culture cells and to a lesser 
extent in liver, breast, and uterus. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immunological diseases, hereditary disorders involving the MHC class 
of immune molecules, as well as developmental'disorders and reproductive disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 

30 type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and reproductive system expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

35 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
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comprising a sequence shown in SEQ ID NO: 390as residues: Ser-39 to Gly-46, Leu- 
49 to Ala-62. 

The tissue distribution and homology to collagen indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
5 hereditary MHC disorders and particularly autoimmune disorders including rheumatoid 
arthritis, lupus,, scleroderma, and dermacomyositis, as well as many reproductive 
disorders, including cancer of the uterus, and breast tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 158 

10 This gene is expressed primarily in the amygdala region of the brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for,differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
■ not' limited to, a variety of brain disorders, particularly those effecting mood and ' 
15 personality. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing. immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the brain and central nervous system, expression of this gene at significantly higher or 

lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

c 

20 tissues) or bodily, fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for treatment and/or diagnosis of a variety of . brain, 
disorders, particularly bipolar disorder, unipolar depression, and dementia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 159 

This gene is expressed in a variety of tissues and cell types including brain, 
30 smooth muscle, kidney, salivary gland and T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell rype(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancers of a variety of organs including brain, smooth muscle, kidney,^ ' 
35 salivary gland and T-cells and cardiovascular disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell cype(s). For a number of disorders 
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of the above tissues or cells, particularly of the central nervous, urinary, salivary, 
digestive, and immune systems, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
5 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level' in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in brain, smooth muscle, and T-cells indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis of 

10 various neurological, and cardiovascular disorders, but not limited to cancer within the 
above tissues. Additionally the gene product may be used as a target in the ' 
immunotherapy- of the cancer. Because the gene is expressed in cells of lymphoid 
origin, the natural gene product may be involved in immune functions. Therefore it may 
be also used as an agent for immunological disorders including arthritis, asthma, 

15 immune deficiency diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 160 

The translation product of this gene shares sequence homology with collagen 
which is thought to be important in cellular interactions, extracellular matrix formation, 

20 - and has been found to be an identifying determinant in autoimmune disorders. 
Moreover, this gene shows sequence homology with the yeast protein, Slsip, an 
endoplasmic reticulum component, involved in the protein translocation process in 
Yeast Yarrowia lipolytica. (See Accession No. -1052828; see also J. Biol.'Chem. 271, 
1 1668-1 1675 (1996).) With mouse, this same region shows sequence homology with 

25 the heavy chain of kinesin. (See Accession No. 2062607.) Recently, suppression of the 
heavy chain of kinesin was shown to inhibits insulin secretion from primary cultures of 
mouse beta-cells. (See Endocrinology 138 (5), 1979-1987 (1997)0 Moreover, kinesin 
was found associated with drug resistance and ceil immortalization. (See 468355.) 
Thus, it is likely that this gene also act as a genetic suppressor elements. 

30 This gene is expressed primarily in the greater omentum and to a lesser extent in 

a variety of organs and cell types including gall bladder, stromal bone marrow cells, 
lymph node, liver, testes, pituitary, and thymus. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the ussue(s) or cell type(s) present in a 

35 biological sample and for diagnosis of diseases and conditions, which include, but are - 
not limited to, disorders of the endocrine, gastrointestinal, and immunological systems, 
including autoimmune disorders and cancers in a variety of organs and cell types. 
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Similarly, polypeptides and antibodies directed to chese' polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell - . 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and gastrointestinal systems,, expression of this gene at significantly higher or 

5 lower levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
• tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 

10 comprising a sequence shown in SEQ ID NO: 393 as residues: Asn-27 to Leu-47, Gln- 
81 to Lys-88, Asp-93 to Lys-102, Asn-107 to Leii-1 16, Met-129 to Glu-141, Glu-150 
to Asp-157, Lys-176 to Glu-185, Glu-333 to Tyr-349, Cys-393 to Leu-403, Gln-423 
to Giy-429. 

The tissue distribution in within various endocrine and immunological tissues 
1 5 combined with the sequence homology to a conserved collagen motif indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis of various autoimmune disorders including, but not limited to, rheumatoid 
. arthritis, lupus erthyematosus, scleroderma, dermatomyositis Because the gene is 
expressed in cells of lymphoid origin, the natural gene product may be involved in 
20 immune functions. Therefore it may be also used as an agent for immunological 

disorders including arthritis, asthma, immune deficiency diseases such as AIDS, and 
leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 161 

25 This gene' has homology to the tissue inhibitor of metailoproteinase 2. Such 

inhibitors are vital to proper regulation of metalloproteins such as collagenases (See 
Accession No. PI 6368). In addition, this 'gene maps to chromosome 17, and 
therefore, may be used as a marker in linkage analysis for chromosome 17 (See 
Accession No. PI 6368). 

30 This gene is expressed primarily in several types of cancer including 

osteoclastoma, chondrosarcoma, and rhabdomyosarcoma and to a lesser extent in 
. several non-malignant tissues including synovium, amygdala, testes, placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

35 • biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, various types of cancer, particularly cancers of bone and cartilage, as 
well as various autoimmune disorders. Similarly, polypeptides and antibodies* 'directed 
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to these polypeptides are useful in providing immunological probes for differential 
identification of the dssue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the musculoskeletal system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

5 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or ceil sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in various cancers , and the sequence homology to -a 

10 coilagenase inhibitor indicates that polynucleotides and polypeptides corresponding to 
this gene are useful for detection of various autoimmune disorders such as rheumatoid 
arthritis, lupus, scleroderma/ and dermatomyositis. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immune deficiency 
diseases such as AIDS, and leukemia." 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 162 

This gene is homologous to the mitochondrial ATP6 gene and therefore is likely 
a homolog of this gene family (See Accession No. X76197). 
This gene is expressed primarily in brain tissue. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, a variety of brain disorders, including Down's syndrome, depression, 
Schizophrenia,. and epilepsy. Similarly, polypeptides and antibodies directed to these 

25 polypeptides are useful in providing immunological probesTor differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nervous system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded- tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

30 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution in brain tissue indicates this gene is useful for diagnosis 
of various neurological disorders including, but not limited to, brain cancer. 

35 Additionally the gene product may be used as a target in the immunotherapy of cancer in 
the brain as well as for the diagnosis of metabolic disorders such as obesity T ay-Sachs 
disease, phenylketonuria and Hurler's Syndrome. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 163 

This gene is expressed primarily in placenta,, neutrophils, and microvascular 
endothelial- cells and to a lesser extent in multiple tissues including brain, prostate, 

5 spleen, thymus, and bone. 

Therefore, polynucleotides and polypeptides of the' invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, neutropenea and other diseases of the immune system. Similarly, 

10 polypeptides and antibodies directed to these polypeptides are useful in providing 

: immunological probes for differential identification of the tissue(s) or cell type(sj. For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 

1.5 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in placenta indicates that polynucleotides and 
20 polypeptides corresponding to this gene are useful for diagnosis various female 

reproductive disorders. Additionally the gene product may be used as a target in the 
immunotherapy of various cancers. Because the gene is expressed in some cells of 
lymphoid and endocrine origin, the natural gene product may be involved in immune 
functions and metabolism regulation, respectively. Therefore it may be also used as an 
25 agent for immunological disorders including arthritis, asthma, immune deficiency 
diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 164 

This gene is expressed primarily in neutrophils, monocytes, bone marrow, and 

30 fetal liver. > 

Therefore, polynucleotides and polypeptides of the invention are useful'as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases .and conditions, which include, but are 
not limited to, immune system disorders including, but not limited to, autoimmune 

35 disorders such as lupus, and immunodeficiency disorders . Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
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of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal "fluid) or another tissue or ceil sample taken from an individual having , 
5 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in various immune system tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis of various immunological disorders such as Hodgkin's lymphoma, arthritis, 
10 asthma, immune deficiency diseases such as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 165 - 

The translation product of this gene shares sequence homology with dystrophin 
which is thought to be defective in both Duchene and Becker Muscular Dystrophy. 
15 Preferred polypeptide fragments comprise the following amino acid sequence: 

MKLLGECSSSIDSVKRLEHKLKEEEESLPGFVNXHSTETQTAGVIDRWELLQAQ 
ALSKELRMKQ^QKWQQFNSPLNSW 

IKKLKELQK.AVDHRKAIlLSINLCSPEFrQADSKESRDLQDRLXQMNGRWDRV 
CSLLEEWRGLLQDALMQCQGFHEMSHGL^ 

20 QDHHKQLMQIKHELLESQLRVASLQDMSCQLLVN.AEGTDCLEAKEKVHV1GNR 
LKLLLKEVSRHIKELEKLLDVSSSQQDLSSWSSADELDTSGSVSPXSGRSTPNR 
QKTPRGKCSLSQPGPSVSSPHSRSTKGGSDSSLSEPXPGRSGRGFLFRVLRAA 
LPLQLLLLLLIGLACLVPMSEEDYSCALSNNFARSFHPMLRYTNGPPPL (SEQ ID 
NO:726); MKLLGECSSSIDSVKRLEHKLKEEEESLPGFVNLHSTETQTAGVIDR 

25 WELLQAQALSKELRN1KQNLQKWQQFNSDLNSIWAWLGDTEE 

TDIQTIELQIK (SEQ ID NO:727); KLKELQICAVDHRKAHLSINLCSPEFTQADSK 
ESRDLQDRLXQMNGRWDRVCSLUEEW 

ENIDRRKNEI VPIDSNLD AEILQD HHKQLM QIKHELLES QLR V AS L QDMS C QL 
(SEQ ID NO:728); QDMSCQLLVNAEGTDCLEAKEKVHVIGNRUa-LLKEVS 

30 RHIKELEKLLDVSSSQQDLSSWSSADELDTSGSVSPXSGRSTPNRQKTPRGKCS 
LSQPGPSVSSPHS (SEQ ID NO:729); DSSLSEPXPGRSGRGFLFRVLRAAL 
PLQLLLLLLIGLACLVPMSEEDYSCALSNNFARSFHPMLR\TNGPPPL (SEQ ID 
NO:730). Also preferred are polynucleotide fragments encoding these polypeptide 
fragments. Furthermore, this gene maps to chromosome 6, and therefore, may be used 

35 as a marker in linkage analysis for chromosome 6 (See Accession No. N62896). 

This gene is expressed in numerous tissues including the heart, kidney, and 
brain. \ 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
- not limited to, musculoskeletal disorders including Muscular Dystrophy and 

5 cardiovascular diseases. Similarly,. polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the muscle tissues, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain ussues (e.g.-, cancerous and wounded 

10 tissues) or bodily fluids (e.gi, serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. ' . _ 

The tissue distribution and homology to dystrophin indicates that 

15 polynucleotides and polypeptides corresponding to this gene are useful for the 
-diagnosis and treatment of Muscular Dystrophy and other muscle disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 166 

This gene is expressed primarily in human cerebellum. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the central nervous system, including Alzheimer's Disease, 
Parkinson's Disease, ALS, and mental illnesses. Similarly, polypeptides and antibodies 

25 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the central nervous system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

30 synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown. in SEQ ID 
• NO: 399 as residues: Pro-20 to Gly-26, Leu-37 to Pro-42, His-57 to Gly-63, 

35 The tissue distribution indicates that the protein products of this gene are useful 

for treatment/diagnosis of diseases of the central nervous system and may protect or 
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enhance survival of neuronal cells by slowing progression of neurodegenerative ' 
diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO:. 167 
5 Preferred polypeptides encoded by this gene comprise the following, amino acid 

sequence: 

MKLLICGN^XAPSHSESSRRCCLLCFYPLCLELNFGMKVFLSMPFLVLFQ 
SLIQED (SEQ ID NO:731). Polynucleotides encoding such polypeptides are also, 
provided. This gene is believed to reside on chromosome 15. Therefore polynucleotides 

10 derived from this gene are useful in linkage analysis as chromosome 15 markers. 

- This gene is expressed primarily in human testes tumor and to a lesser extent in 
normal human testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the testes, particularly cancer, and other reproductive 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

20 the male reproductive tissues, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal" fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

25 fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for treatment/diagnosis of testicular diseases including cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 168 

30 This gene is expressed primarily in fetal liver. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
. not limited to, conditions affecting hematopoietic development and metabolic diseases. 
35 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
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• hepatic system, and fetal hematopoietic system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, ( plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
5 relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. Preferred epitopes include 
those comprising a sequence shown in SEQ ID NO: 401 as residues: His-7 to Trp-17, 
Leu- 19 to Lys-27, Pro-33 to Gly-44, Lys-68 to Gly-74, Lys-85 to Cys-95. 

The tissue distribution indicates that the protein products of this gene are useful 
10 for treatment/diagnosis of diseases of the developing liver and hematopoietic system, '.. 
and act -as a growth differentiation factor for hematopoietic stem cells. - 

FEATURES OF PROTEIN ENCODED BY GENE NO: 169 

The polypeptide encoded by this gene is believed to be a membrane bound 
15 receptor/The extracellular domain of which is expected to consist of the following 
amino acid sequence: 

RILLVKYSANEENKYDtLFrTVNVCSELVKLVFCVLVSFCVIKKDHQSRNLKY 
ASWKEFSDFMKWSIPAF^ 

LKXRLNWIQWASLLTLFLSIVALTAGTKTLQHNLAGRGFHHDAFFSPSNSCLL 
20 FRNECPRKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVL 

YNEKILKEGNQLTEXIFIQNSKLYFFGILFNGLTLGLQRSNRDQIKNCGFFYGH 
S (SEQ ID NO:732). Thus, preferred polypeptides encoded by this gene comprise the 
extracellular domain as shown above. It will be recognized, however, that deletions of 
either end of the extracellular domain up to the first cysteine" from the N-terminus and 
25 the first cysteine of the C-terminus, is expected to retain the biological functions of the 
full-length extracellular domain because the cysteines are thought to be responsible for 
providing secondary structure to the molecule. Thus, deletions of one. or more amino 
acids from either end (or both ends) of the extracellular domain are contemplated. Of 
course, further deletions including the cysteines are also contemplated as useful as such 
30 polypeptides is expected to have immunological properties such as the ability to evoke 
and immune response. Polynucleotides encoding all of the foregoing polypeptides are 
provided. 

This gene is expressed primarily in human osteoclastoma and to a lesser extent 
in hippocampus and chondrosarcoma. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as . 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, cancers, particularly those of the bone and connective tissues. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the. skeletal system, 
5 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
10 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 402 as residues: Met-1 to Cys-6, Ala-41 to Tyr-49, Lys-76 to Lys-84. 

The tissue distribution indicates that the protein products of this gene are useful 
for diagnosis of cancers of the bone and connective tissues, and may act as growth 
factors for cells involved in bone or connective tissue growth. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 170 

Preferred polypeptides encoded by this gene comprising the following amino 
acid sequence: 

NSVPNLQTLAYLTEAIGPEPAIPRXPREPPVATSTPATPSAGPQPLPTGTV 

20 LVPGGPAPPCLGEAWALLLPPCRPSLTSCFWSPRPSPWKETGV (SEQ ID 
NO:733). Polynucleotides encoding such polypeptides are also provided herein. 
This gene is expressed primarily in hematopoietic progenitor cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions, which include, but. are 
not limited to, diseases of the blood including cancer and autoimmune disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of -disorders of the above tissues or cells, particularly of the 

30 blood/circulatory system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the ' 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

35 fluid from an individual not having the disorder. Preferred epitopes include those 

comprising a sequence shown in SEQ ID NO: 403 as residues: Gln-4 to His- 10, Pro-25 
toHis-32. - 
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The tissue distribution indicates that the prccein products of this gene are useful 
for diagnosis of diseases involving growth differentiation of hematopoietic cells... 

FEATURES OF PROTEIN ENCODED BY GENE NO: 171 
: 5 Preferred polypeptides encoded by this gene comprise the following amino acid 

sequences: ALQLAFYPDAVEEWLEENVHPSLQRLQXLLQDLSEVS APP (SEQ ID 
NO:734); and/or CHPPALAGTLLRTPEGRAHARGLLLEAGGA (SEQ ID NO:735). 
Polynucleotides encoding such polypeptides are also provided. The protein product of 
this gene shares sequence homology with metallothionines. Thus, polypeptide encoded 
10 by this gene are expected to have metallothionine activity, such activities are known in 
• the art and -described elsewhere herein. 

This gene is expressed primarily in kidney cortex. 

Therefore, polynucleotides and polypeptides of the. invention are useful as 
reagents for differential identification of the dssue(s) or cell type(s) present in a 

15 biological, sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases of the kidney including cancer and renal dysfunction. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the renal system, 
- 20 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum,, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

25 . disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 404 as residues: Ser-47 to Gln-52. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of diseases of the kidney ' 
including kidney failure. 

30 . 

FEATURES OF PROTEIN ENCODED BY GENE NO: 172 

This gene Is expressed primarily in 12 week old early stage human. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a " 
35 biological sample and for diagnosis of diseases and conditions, which indlude, but are 
not limited to, developmental abnormalities. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
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differential identification of the cissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the developing embryo, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

5 synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
. expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 405 as residues: Gln-31 to Thr-43, Gly-51 to Ser-58, Pro-65 to Pro-72. 

10 - The tissue distribution, indicates that polynucleotides and polypeptides 

, corresponding to this gene are useful for treatment/diagnosis of developmental 
problems with fetal tissue. The gene may be involved in vital organ development in the 
* early stage, especially hematopoiesis, cardiovascular system, and neural development. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 173 

The translation product of this gene shares sequence homology with TGN38, an 
integral membrane protein previously. shown to be predominantly localized to the trans- 
.Golgi network (TGN) of cells. 

This gene is expressed primarily in developing embryo and to a lesser extent in 
20 cancer tissues including lymphoma, endometrial, protate and colon. 

Therefore, polynucleotides and polypeptides of the invention are useful as. 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, developmental abnormalities and cancer. Similarly, polypeptides and 
25 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the developing fetus, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
30 synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i!e., the 
expression level in healthy tissue or bodily fluid from an individual not having the • 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 406 as residues: His-65 to Ser-72, Pro-82 to Gly-91, Pro-98 to Glu-1 18, Ser-126 
35 to Gly-166, Pro- 180 to Asp- 188, Tyr-209 to Lys-214, Gln-220 to Leu-228. 

. The tissue distribution and homology to an integral membrane protein indicates 
thai polynucleotides and polypeptides corresponding to this gene are useful for 
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diagnosis of cancers and developmental abnormalities where aberrant expression relates 
to an abnormality. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 174 

5 ■ The translation product of this gene shares sequence homology with a dnaJ heat 

shock protein from E. coli which is allelic to sec63, a gene that affects transit of nascent 
secretory proteins across the endoplasmic reticulum in yeast. 

. This gene is expressed primarily in Hodgkin's lymphoma and to a lesser extent 
in testes. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancer. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing- immunological probes for differential identification 
15 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
. particularly of the immune system, expression of this gene at significantly higher or 
lower. levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
• tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
.20 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes- include those 
- comprising a sequence shown in SEQ ID NO: 407 as residues: Thr-13 to Trp-21, Arg- 
74 to Asp-8 1 . 

The tissue distribution and homology to dnaJ indicates that polynucleotides and 
25 • polypeptides corresponding to this gene are useful as a diagnostic for cancer including 
Hodgkin's lymphoma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 175 

This. gene is expressed primarily in endothelial cells and to a lesser extent in 
30 bone marrow stromal cells. 

Therefore, polynucleotides and polypeptides of the invention are- useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, diseases involving angiogenic abnormalities including diabetic . 
35 retinopathy, macular degeneration, and other diseases including arteriosclerosis and 

cancer. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the 
vascular system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
for treating diseases where an increase or decrease in angiogenesis is indicated and as a 
factor in the wound healing. process. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 176 

The translation product of this gene shares sequence homology with MATS ' 
(mouse) which is thought to be important in regulating chloride conductance in cells 
(particularly in the breast) by modulating the response mediated by cAMP and protein 
kinase C to extracellular signals. 

This gene is expressed primarily in amniotic cells and hematopoeitic cells 
including macrophages, Neutrophils, T cells, TNF induced aortic endothelium and to a 
lesser extent in testes, TNF induced epithelial cells, and smooth muscle. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, inflammatory responses mediated by T cells, macrophages, and/or 
neutrophils particularly those involving TNF, and also cancer. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, piasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 4G9 as residues: Thr-19 to Ala-33, Leu-54 to Asp-82, Pro-89 to Ala-97, Pro- 100 
to Lys-125, Ser-127 to Phe-135, Gly-164 to Leu-169, Cys-173 to Arg-178. 

The tissue distribution and homology to mat-8 indicates that polynucleotides and 
polypeptides conresponding to this gene are useful for modifying inflammatory 
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inflammation. Polynucleotides and polypeptides derived from this gene are thought to 
be useful in the diagnosis and treatment of cancer. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 177 
This gene is expressed primarily in endothelial cells. 
Therefore, polynucleotides and polypeptides of the invention are. useful as 
reagents for differential identification of the ussue(s) or cell type(s). present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
10 not limited to, vascular restenosis. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes' for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the vascular system, expression of this gene at 
, . significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
15 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
' 20 corresponding to this gene are useful for treating. diseases associated with vascular 
response to injury such as vascular restenosis following angioplasty.. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 178 

One embodiment of the claimed invention comprises: 
25 MRPDWKAGAGPGGPPQKPAPSSQRKPPARPSAAAAA 

RLRLEEDKPAVERCL^ELWGDVENDEDALLRRLRGPRVQEHED^GDSEVENEA 
KGNFPPQKXPVWVDEEDEDEEMVDN 

RLKEEFQHANIGGVPAWAETTKRKTSSDDESEEDEDDLLQRTGNHSTSTSLPRG 
ILKiMKNCQHANAERPTVARISICAVPSRCTDCDGCWD (SEQ ID NO:737); or 
30 CLEELVFGDVENDEDALLRRLRGPRVQEHEDSGDSEVENEAKG^^PQKKPy 
WVDEEDEDEEMVDMMNNRFRK^ 

GVPAWAETTKRKTSSDDESEEDEDDLLQRTGNHSTSTSLPRGILK^IKNCQHA 
NAERPTVARISICAVPSRCTDCDGC (SEQ ID NO: 733). LKEKIVRSFEVSPDGS 
FLLINGL\GYLHLL.\MKTKELIGSMKINGRVAASTFSSDSKKVYASSGDGEV^ 
35 WDVNSRKCLNRF\ r DEGSLYGLSL\TSRNGQYVACGSNCGVVNIYNQDSCLQE 
TNPKPIKAIMNLVTGVTSLT 

KNKNISHVHTMDFSPRSG\TALGNEKGKL\LMYRLHHYSDF (SEQ ID"NO:739); 
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and/or KINGRVAASTFSSDSKKVYASSGDGEVYVWDVNSRKCUNRFYDEGSL 

YGLSLATSRNGQYVACGSNCGVVNIYNQDSCLQETNPKPIK.^MNLVTGVTSLT 
FNPTTEILAIASEKJ/1KEAVRLVHLPSCTVFSNTPV 

YFALGNEKGKAL (SEQ ID NO:740). 

5 This gene is expressed primarily in epidydimus and endometrial tumors and to a 

lesser extent in T cell lymphoma and cell lines derived from colon cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, tumors of the reproductive organs including testis and endometrial cells. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells,, particularly of the 
reproductive system; expression of this gene at significantly higher or lower levels may 

15 be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
- cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 

20 sequence shown in SEQ ID NO: 41 1 as residues: Ser-67 to Lys-72, Val-87 to Leu-93, 
Ty r- 1 28 to Pro- 141, Asp-204~ to GIy-2 1 0. 

The tissue distribution indicates that the protein products of this gene are useful 
for treating tumors of the endometrium or epithelial tumors of the reproductive system. 

. 25 FEATURES OF PROTEIN ENCODED BY GENE NO: 179 

Preferred polypeptides encoded by this gene comprise the following amino acid 
sequence: 

MRILQLILLALATGLVGGETRIIKGFECKLHSQPWQAALFEKTRLLCGATLIAPR 

WLLTAAHCLKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDH 
30 RNDIMLVKMASPVSITWAV^ 

DAPTSPSLSTRSVRTPTPATSQTPWCVPACRKG.ARTPARVTPGALWSVTSLFK.A 
LSPGARIRVRSPESLVSTRKSANMWTGSRRR (SEQ ED NO:741); ETRIIKGFEC 
KLHSQPWQAALFEKTRIXCGATLI^ 
GCEQTRTATESFPHPGFNNSLPNKDHRiNDLMLVm 

35 CVTAGTSCSFPAGAARPDPSYACLTPCDAPTSPSLSTRSVRTPTPATSQTPWCVP 
. ACRKGARTPARVTPGALWSVTSLFKALSP 
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SRRR (SEQ ID NO:742); or CKLHSQPWQAALFEKTRLLCGATLIAPRWLLT 
AAHCLKPRYIVHLGQHNLQKEEGCE.QTRTATESFPHPGFNS 
(SEQ ID NO:743). The translation product of this gene shares sequence homology 
with neuropsm a novel serine protease which is thought to be important in modulating 
5 extracellular signaling pathways in the brain. Owing to the structural similarity to other 
serine proteases the protein products of this gene are expected to have serine protease 
activity which may be assayed by methods known in the art and described elsewhere 
herein. 

. This gene is expressed primarily in endometrial tumor and to a iesser extent in 

10 .. colon cancer, benign hypertrophic prostate, and thymus. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancers of the endometrium or colon and benign hypertrophy of the 

15 prostate. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the urogenital or reproductive systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain dssues (e.g., cancerous and wounded 

20 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative .to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO: 412 as residues: Gly-12 to Ser-22, Pro- 

25 34 to Ser-53. 

The tissue distribution and homology to serine proteases indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosing 
or treating hperproliferative disorders such as cancer of the endometrium or colon and 
hyperplasia of the. prostate. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 180 

Preferred polypeptide encoded by this gene comprise the following amino acid 
sequence: VLQGRYFSPILEMRR^ 

PHFAMTRSYVPTKQCMVQGSFYCIFIFKGPVQNWC (SEQ ID NO:744). 
35 Polynucleotides, encoding such polypeptide are also provided. 
'This gene is expressed primarily in fetal brain 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the ussue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are - 
not limited to, identifying and expanding stem cells in the CNS. Similarly, polypeptides 
5 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the nervous system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
10 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an* 

individual having such a disorder, relative to the standard gene expression level, i.e., , 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein products of this gene are useful 
15 for detecting and expanding stem cell populations in the (or of the) central nervous 
system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 181 

This gene is expressed primarily in early stage human brain and a stromal cell 

20- line. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis- of diseases and conditions, which include, but are 
not limited to, developmental abnormalities of the CNS. Similarly, polypeptides and 

25 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the central nervous system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids ("e.g., serum, plasma, 

30 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
, individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 414 as residues: Gln-42 to Gln-47, Gln-54 to Pro-60. 

35 The tissue distribution indicates that the protein products of this gene play a role 

* in the development of the central nervous system. Therefore this gene and its products 
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are useful for diagnosing or treating developmental abnormalities of the central nervous 
system. . 

FEATURES OF PROTEIN ENCODED BY GENE NO: 182 
5 . Preferred polypeptides encoded by this gene comprise the following amino acid 

sequence: 

MPIIDQVNPELHDFMQSAEYGTIF^SWLITWPGHVLSDFRHVVRLYDF 
FLACHPLMPIYFAAVIVLYREQE 

TFLFSFPHPNLLGRPLPNSKLRGRQPLLSKTLSWHQPSRGLIWCCGSGXRGLL 

1 0 RPEDRTKD VLTKPRTNREVKLA VMGLTV ALG AAALA VVKS ALEW APKFQLQL. 
FP (SEQ ID NO:745); or CPEFFIP ATLPCPFVF AFTS E ASS RA YLTQRGPGGL AQ 
NLMPLPVGFWMGSLPPPWCWRKWVSEACSCFC (SEQ ID NO:746) These 
polypeptides are structurally- similar to various TGF-beta family members. Thus, this 
• polypeptide is expected to have a variety of activities in the modulation of cell growth 

15 and proliferation. 

This gene is expressed primarily in osteoclastoma, microvascular endothelium, 
and bone marrow derived cell line's. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions, which include; but are 
not limited to, hematological diseases particularly involving aberrant proliferation of 
stem cells. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissiie(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

25 the immune system, expression of this gene at significantly higher or lower levels may 
be routinely, detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g.; serum, plasma,, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

30 individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO: 415 as residues: Ser-33 to Ala-39. 

The tissue distribution indicates that the protein products of this gene is useful 
for treating disorders of the progenitors of the immune system. Applications include in 
vivo expansion of progenitor cells, ex vivo expansion of progenitor cells, or the 

35 treatment of tumors of the circulatory system, such as lymphomas. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 133 

This gene maps to chromosome 17 and therefore, polynucleotides of the 
invention can be used in linkage analysis as a marker for chromosome 17. In specific 
embodiments, polypeptides of the invention comprise the sequence: 
5 GFGS VS AAGRRSGGTWQP VQ (SEQ ID NO:747): PGGLAVGSRWWSRSLT 
(SEQ ID NO:748); LEPSRQRRPRRRGGTSRPETDQRAKCWRQL (SEQ ID 
NO:749); and/or VCLRCQNRMEN (SEQ ID NO:750). In further specific 
embodiments, polypeptides of the invention comprise the sequence: MAACTARRPGR 
GQPLWPVADXGPVAKAALCAAXAGA 
10 GVFEVPKQNGKYETGQLFLHSIFGYRGVVLFPWQARLXDRDVASAAPEK^EN 
P AGHGS KE VKGKTHT Y Y Q VLID ARDCPHIS QRS QTE A VTFL ANHD DS RAL Y ATP 
GLDYVSHEDILPYTSTDQVPIQHELFERFLLYDQT 
PWLELSDVHRETTENIRVTWFYMGMREAQNS 

LRERHWRIFSLSGTLETVRGRGWGREPVLSKEQP.AFQYSSHVSLQASSGHMW 

15 GTFRFERPDGSHFDVRIPPFSLESNICDEKTPPSGLHW (SEQ ID NO:751); 
MAACTARRPGRGQPLVWVADXGPVAKAALCAA (SEQ ID NO:752); 
VLETVGVFEVPKQNGKYETGQLFLHSIFGYRGVVL (SEQ ID NO:757); 
GLDYVSHEDELPYTST (SEQ ID NO:758); DVHRETTENIRVTVIPFYM (SEQ ID 
NO:759); WWRYCIRLENLDSDVVQLRER (SEQ ID NO:760); and/or PAFQYSS 

20 HVSLQASSGHMWGTFRFER (SEQ ID NO:761). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

This gene is expressed primarily in gall bladder, prostate, and fetal brain, and to 
a lesser extent in a few tumor and fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, growth related disorders such as cancers. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

30 of the above tissues or cells, particularly of the prostate, gall bladder, and fetal brain, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

35 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of growth-related 
disorders* such cancers. - 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 184 

In specific embodiments, polypeptides of the invention comprise the 
sequence :SLCCPEGAEGC (SEQ ID NO:762) and/or QLKKTHYDRPCP (SEQ ID 
NO;763). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

1G ' This gene is expressed primarily in stromal cell, tonsil, and glioblastoma and to 

a lesser extent in some other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell typefs) presenUn a 
. biological sample and for diagnosis of diseases and conditions, which include, but are 

15 not limited to, immune and inflammatory disorders and glioblastoma.' Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the stromal cells, 
.. tonsil, and glioblastoma expression of this gene at significantly higher or lower levels 

20 may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or-cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level,. i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Additionally, it is believed that the 

'25 product of this gene regulates pancreatic cell differentiation into beta cells'. Accordingly, 
polynucleotides and polypeptides of the invention are useful in the treatment of insulin- 
dependent diabetes mellitus and associated conditions e.g. pancreatic hypofunction and 
the prevention, as well as the treatment of undifferentiated type pancreatic cancers. 
Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 417 as 

30 residues: Pro-27 to Ala-32. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of immune' and 
inflammatory disorders and glioblastoma. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 185 

This gene is expressed primarily in hepatocellular carcinoma and v to a lesser 
. extent in other tissues. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and condidons : which include, but are 
not limited to, liver diseases. Similarly, polypeptides and antibodies directed to these 
5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the liver, expression of^this gene at significantly higher or lower levels 
may be routinely detected in certain tissues (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

10 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the' expression level in healthy tissue or bodily- 
fluid from an individual not having the' disorder. Preferred epitopes include those, 
comprising a sequence shown in SEQ ID NO: 418 as residues:" GIy-32 to Lys-39. 
The tissue distribution indicates that polynucleotides and polypeptides 

15 corresponding to this gene are useful for diagnosis and treatment of liver diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 186 

This gene is expressed primarily in hippocampus and to a lesser extent in other 

tissues. 

. 20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
. not limited to, neutronal disorders. Similarly, polypeptides and antibodies directed to 
these polypepddes are useful in providing immunological probes for differential 
25 identification of the tissue(s) or cell type(s). For a number of disorders of the above 

tissues or cells, particularly of the hippocampus, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
' fluid) or another dssue or cell sample taken from an individual having'such a disorder, 
30 relative to the standard gene expression level, i.e., the expression level in healthy dssue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypepddes 
corresponding to this gene are useful for diagnosis and treatment of neuronal disorders. 

35 FEATURES OF PROTEIN ENCODED BY GENE. NO: 187 

This gene is expressed primarily in bone cancer and hippocampus and to a 
lesser extent in osteoclastoma and other tissues. 
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. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are . 
not limited to, bone-related disorders and neuronal diseases. Similarly, polypeptides 

5 and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the bone, ostoeciast, and 
hippocampus, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 

10 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

15 corresponding to this gene are useful for diagnosis and treatment of bone-related 
disorders and neuronal diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 188 

This gene maps to chromosome 4 and therefore polynucleotides of the invention 
20 can be used in linkage analysis as a marker for chromosome 4. 

• This gene is expressed primarily in neuronal tissues such as hippocampus, 
spinal cord, and hypothalamus and to a lesser extent in a few other tissues such as 
ovary. . 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited. to, neuronal diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
30 tissues or cells, particularly of the neuronal tissues, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
' such a disorder, relative to the standard gene expression level, i.e., the expression level 
35 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides . 
corresponding to this gene are useful for diagnosis and treatment of neuronal disorders. 
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This gene maps to chromosome 10, therefore, polynucleotides of the invention 
can be used in linkage analysis as a marker for chromosome 10. 
5 This gene is expressed primarily in neuronal tissues and immune tissues, and to 

a lesser extent in a few other tissues such as skin tumor, lung etc. 

■ Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a ■ 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, neuronal and immune-related disorders. Similarly, polypeptides and 

antibodies directed to 'these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the neuronal and immune -related tissues, 
expression of this gene at significantly higher or lower levels may be routinely detected 

15 in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression' level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 

20 NO: 422 as residues: Pro-19 to Asp-25. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of neuronal and 
immune-related disorders. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 190 

The translation product of this gene shares sequence homology with human 
N33, a gene located in a homozygously deleted region of human metastatic prostate 
cancer which is thought to be important in prevention of prostate cancer. In specific 
embodiments, polypeptides of the invention comprise the sequence: 

30 AQRKKEMVLSEKVSQLMEW 

LQLHRQCWCKQADEEFQILANSWRYSSA 
NSAPTEtNFPAKGKPKRGDTYELQVR 

ARWRFWCVSVT (SEQ ID NO:765); MVVALLIVCDVPSAS (SEQ ID NO:766); 
AQRKKEMVLSEKVSQL (SEQ ID NO:767); MEWTNKRPVIRMNGDKF (SEQ 
3 5 E>:768); RRLVKAPPRNYSVIVM^ 

SSAFTNRIFFA (SEQ ID NO:769); MVDFDEGSDVFQMLNMNS APTFINFPAK 
GKP (SEQ ID NO:770): KRGDTYELQVRGFSAEQLARWLADRTDVNIRVIRPPN- 
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(SEQ ID NO:771); and/or YAGPLMLGLLLAVIGGLV\1.RRVTWNFSLIKLDGLLQL . 
CVLCLL (SEQ ID NO:772). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily in infant adrenal gland prostate cell line and to 
5 a lesser extent in a few other tissues like liver, smooth muscle etc. 

Therefore, polynucleotides and polypeptides of the invention are-useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a " 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, prostate cancer and endocrine disorders. Similarly, polypeptides and 

10 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders : 
of the above tissues or cells, particularly -of the prostate and adrenal gland, expression 
of this gene at significantly higher or lower levels may be routinely detected m certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

15 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder,' relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid, from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 423 as residues: Pro-34 to Gly-43, Arg-1 13 to Pro-120. 

20 The tissue distribution and homology to N33 indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for diagnosis and treatment for 
prostate cancer and endocrine disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 191 

25 This gene is expressed primarily in T cell and to a lesser extent in fetal lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 

30 these polypepddes are useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
dssues or cells, particularly of the immune, expression of this gene at significantly ' 
higher or lower levels may' be routinely detected in certain dssues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, 'plasma, urine, synovial fluid or spinal 

35 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
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or bodily fluid from an individual not having che disorder. Preferred epitopes .include 
those comprising a sequence shown in SEQ ID NO: 424 as residues: Trp-3 to Phe-9. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of immune disorders. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 192 

This gene maps to chromosome 6, therefore, polynucleotides of the invention 
can be used in linkage analysis as a marker for chromosome 6. Neural activity and 
neurotrophic induce synaptic remodeling in part by altering gene expression. This 

10 gene is believed to be a glycosyiphoshatidylinositol-anchored protein encoded by a 
bippocampal gene and to possess neural activity. This- molecule is believed to be 
expressed in postmitotic-differentiaung neurons of the developing nervous system and 
neuronal structures associated with plasticity in the adult. Message of this gene is 
believed to be induced by neuronal activity and by the activity-regulated neurotrophins 

15 . BDNF and NT-3. The product of this gene is believed to stimulate neurite outsrowth • 
and arborization in primary embryonic hippocampal and conical cultures and to act as a 
downstream effector of activity-induced neurite outgrowth. In specific embodiments, 
polypeptides of the invention comprise the sequence: DAVFKGFSDCLLKLGDS (SEQ 
ID NO:773); CQEGAKDMWDKLRKESKNLN (SEQ ID NO:774); 

20 VLLVSLSAALATWLSF (SEQ ED NO:775); MGLKLNGRYISLILA VQIAYLVQAVR 
- AAGKCDAVFKGFSDCLLKLGDS (SEQ ID NO:776); PAAWDDKTNIKTVCTW 

EDFHSCTVTALTDCQEGAKD^VDKXRKESKiNXNIQGSLFELCGSGNGAAGSL 
LP AFP VLLVSLSAALATWLSF (SEQ ID NO: 777); and/or MGLKLNGRYISLILA 

VQL^YLVQAVR.^GKCDAVFKGFSDCU.KLGDSXXXXXPAAWDDKTNIKTVC 
25 TYWEDFHSCTVTALTDCQEGAKDMW^ 

GSLLP AFP VLLVSLSAALATWLSF (SEQ ID NO:778). Polynucleotides encoding 
this polypeptide are also encompassed by the' invention. 

This gene is expressed primarily in human placenta, endometrial tumor and 
tissues of the central nervous system (CNS). 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present ini 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, relating to reproductive disorders, cancers and neurological diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

35 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
reproductive and neurological disorders, expression of this gene at significantly hiaher 
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or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
5 or bodily fluid from an individual not having the disorder. Preferred epitopes include ' 
those comprising a sequence shown in SEQ'ID NO: 425 as residues: Asp-47 to Asp- 
63, His-75 to Tyr-80, Pro-83 to Tyr-89. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of reproductive 
10 disorders such as endometrial." tumors. Expression of this gene in tissues of the CNS 

and its strong, homology to Neuritin suggest that the protein product from this gene may 
also be used in the treatment and diagnosis of neurological disorders and in the 
regeneration of neural tissues, e.g., following injury. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 193 

The translation product of this gene shares sequence homology with tenascin 
which is thought to be important in development. The translation product of this gene is 
believed to be a iigand of the fibroblast growth factor family. FGF ligand activity is 
known in the art and can be assayed by methods known in the art and disclosed 

20 elsewhere herein. 

This gene is expressed primarily in endometrial tumors, and other types of 
tumors. i _ 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the-tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
. . particularly of the cancer dssues,. expression of this- gene at significandy higher or lower 

30 levels may be routinely detected in certain tissues (e.g., cancerous and wounded 

tissues) or bodily fluids (e.g., serum,. plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell, sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. Preferred epitopes include those 

35 comprising a sequence shown in SEQ ID NO: 426 as residues: Gly-29 to Glu-34, Arg- 
71 to Arg-76, Thr-176 to Cys-182, Gly-184 to Glu-199. 
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The tissue distribution and homology to tenascin indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
cancers. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 194 

In specific embodiments, polypeptides of the invention comprise the sequence: 
MNSAAGFSHLDRRERVLKLGESFEKQPRCASTLC (SEQ ID NO:779). 
Polynucleotides encoding these polypeptides are also-encompassed by the invention. 
This gene is expressed primarily in fetal human lung and neutrophils. 
10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, lung development and respiratory disorders. Similarly, polypeptides and - 
antibodies directed to these polypeptides are useful in providing immunological probes 
15 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, panic ularly of the respiratory system, expression of this 
i gene at significantly higher or lower -levels may be routinely detected in cenain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample . taken from an individual 
* 20 having such a disorder, relative to the standard gene expression level, i.e., the ■ 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in fetal lung and neutrophils indicates that 
polynucleotides and polypeptides conesponding to this gene are useful for diagnosis 
25 and treatment of lung and immunity related diseases, for example, lung cancer, viral, 
fungal or bacterial infections (e.g. lesions caused by tuberculosis), inflammation (e.g. 
pneumonia), metabolic lesions etc. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 195 

30 This gene is expressed primarily in breast lymph node. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and' for diagnosis of diseases and conditions, which include, but are 
not limited to, immunal disorders. Similarly, polypeptides and antibodies directed to 

35 these polypeptides are useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, panicularly of the immune system, expression of this gene at 
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significantly higher or lower levels may be routinely detected in certain tissues (e.g:, 
cancerous and wounded' -tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the. standard gene expression level, i.e., the expression level . 
5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding- to this gene are useful for the diagnosis and treatment of immunal - 
disorders. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 196 

This gene maps to chromosome 5 and accordingly, polynucleotides of the invention can ~~ 
be used in linkage analysis as a marker for chromosome 5. The translation product of 
this gene shares sequence homology with human M-phase phosphoprotein 4 which is^ 
thought to be important in phosphorylation and signal transduction processes. In 

15 specific embodiments, polypeptides of the invention comprise the sequence: 
TIYPTEEELQAVQKIVSITERALKLVSD (SEQ ID NO:7S0); RALKGVLRV 
GVLAKGLLLRGDRNVNLVLLC (SEQ ID NO:73 1); ALAALRHAKWFQARAN 
GLQSCVinRILRDLCQRVPTWS (SEQ ID NO:732); GDALRRVFECISSGIIL (SEQ 
ID NO:7S3); LAFRQIHKVLGMDPLP (SEQ ID NO:784); and/or TIYPTEEELQAVQ 

20 KIVSITERALKLVSDSLSEHEKNKNKEGDDKKEGGKDR.^KGVLRVGVLAKG 
LLLRGDRNVNLVLLCSEK^SKTLLSRL^ENLPKQLAVISPEKYDIKCAVSEAAIIL 
NSCVTEPKMQVTITLTSP^ 
HAKWFQARANGLQSCVfflRILRDLCQRV^ 

QSPGDALRRVFECISSGIILKGSPGLLDPCEKDPFDTLATMTDQQREDrrSSAQFA 
25 LRLLAFRQIHKVLGMDPLPQMSQRFNIHNNRKRRRDSDGVDGFEAEGKKDKK 
DYDNF (SEQ ID NO:785). Polynucleotides encoding these polypeptides are^also 
encompassed by" the invention. 

This gene is expressed primarily in Human Hippocampus and to a lesser extent 
in Prostate, Human Frontal Cortex. 
30 Therefore,, polynucleotides and polypeptides of the invention are useful as ' 

reagents for differential identification of the tissue(s).or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include,- but are 
not limited to, disorders related to reproductive system and nervous system. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing ' 
35 immunological, probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive 
system and nervous system, expression of this gene at significantly higher or lower ' 
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levels may be routinely detected in certain tissues (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
5 fluid from an individual not having the disorder. 

The tissue distribution and homology to human M-phase phosphoprotein 4 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
the diagnosis and treatment of reproductive and nervous system disorders. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 197 

In specific embodiments, polypeptides of the invention comprise the sequence: 
MGS QHS AAARPS S CRRKQEDDRDG (SEQ ID NO:786); 
L1AEREQEEAIAQFPYVEFTGRDSITCLTC (SEQ ID NO:737); and7or 
QGTGYIPTEQVNELVALIPHSDQRLRPQRTKQYV (SEQ ID NO:788). 
15 Polynucleotides encoding these polypeptides are also encompassed by the invention. 

This gene is expressed primarily in Human Primary Breast Cancer and to a 
lesser extent in Human Adult Spleen, Hodgkin's Lymphoma I, Salivary Gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
20 biological sample and for diagnosis of diseases and conditions, which- include, but are 
not limited to, cancer and immunal disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s).- For a number of disorders of 
■ the above tissues or cells, particularly of the cancer and immune system, expression of - 
25 this gene at significantly higher or lower levels may be routinely detected in certain 
tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid frdm an individual not having the 
30 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
. NO: 430 as residues: Ser-126 to Gly- 138. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of cancer and 
immunal disorders. 

35 
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-FEATURES 'OF PROTEIN ENCODED BY CENE NO: 198 
" This gene is expressed primarily in monocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential' identification. of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, blood cell disorders. Similarly, polypeptides and antibodies- directed to 
- these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, piarticularly of the immune system, expression of this gene at 
10 significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
. 15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the diagnosis and treatment of blood cell 
disorders. '' 

FEATURES OF PROTEIN ENCODED BY GENE NO: 199 
20 . This gene is expressed primarily in Human Ovary and Synovia and to a lesser 

extent in Human 8 Week Whole Embryo. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential idendfication of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
. 25 not limited to, reproductive and developmental disorder's. Similarly, polypeptides and 
antibodies directed to these polypeptides- are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the reproductive and developmental system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
30 in- certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another -tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue ,or bodily fluid from an individual not having the 
disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides . 

corresponding to this gene are useful for the diagnosis and treatment of reproductive 
and developmental disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 200 

This gene maps to chromosome 8 and therefore polynucleotides of the invention 
can be used in linkage analysis as a marker for chromosome 8. The translation product 
5 of this gene shares limited sequence homology with collagen proline rich domain. 
This gene is expressed primarily in CNS. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 

10 not limited to, neurological diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the nervous system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues (e.g., 

15 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from, an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 433 as residues: 

20 Pro-35 to Asp-41. 

The tissue distribution indicates that polynucleotides and polypeptides _ 
corresponding to this gene are useful for 'diagnosis and treatment of neurological 
diseases. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 201 

Translation product of this gene shares homology with a mammalian histone 
Hi a protein. One embodiment for this gene is the polypeptide fragments comprising the 
following amino acid sequence: ARLNVGRESLKREMLKSQGVKVSES PMG AR 
HSSWPEG AAFCKKVQGAQMQFPPRR (SEQ ID NO:789); ARLNVGRESLKR 

30 EML (SEQ ID NO:790); LKSQGVKVSESPMGARHSSW (SEQ ID NO:791); 

AFCKKVQGAQMQFPPRR (SEQ ID NO:792). An addidonal embodiment is the 
polynucleotide fragments encoding these polypeptide (See Accession No. piriS24178) 
fragments. 

" This gene is expressed primarily in neutrophils. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
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not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
5 significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level- 
in healthy tissue or bodily fluid from an individual not having the disorder. 
10 _The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of immune disorders. 
Since the gene is expressed in cells of lymphoid origin, the natural gene product may be 
* involved in vital immune functions. Therefore it may be also used as an agent for 
immunological disorders including arthritis, asthma, immune deficiency diseases such 
■15 as AIDS, and leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 202 
This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential * 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
25 tissues or cells, particularly of the immune system, expression of this, gene at 

significantly higher or lower levels may be routinely detected in certain tissues (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine,, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
30 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides * 
corresponding, to this gene are useful for diagnosif^n^ treatment of immune disorders. 
'.' Since the gene is expressed in cells of lymphoid origin,' the natural gene product may be 
involved in immune functions. Therefore it may be also used as an agent for 
35 immunological disorders including arthritis, asthma, immune deficiency, diseases such 
as AIDS , and leukemia. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 203 
This gene is expressed primarily in Neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, infectious disorders, immune disorders, and cancers. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 

10 a number of disorders of the above tissues or ceils, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues- (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

15 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO: 436 as residues: Thro I to Lys-36. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of infectious 

20 disorders, immune disorders, and cancers. Since the gene is expressed in cells of 
lymphoid origin, the natural" gene product may be involved in immune functions. 
Therefore it may be also used as an agent for immunological disorders including 
arthritis, asthma, immune deficiency diseases such as AIDS, and leukemia. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker " 

25 and/or immunotherapy targets for the above listed tumors and tissues. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 204 

This gene maps to chromosome 16 and therefore polynucleotides of the 
invention can be used in linkage analysis as markers for chromosome 16. The 
30 translation product of this gene shares sequence homology with lactate dehydrogenase 
which is thought to be important in lactate metabolism. 

This gene is expressed primarily in human tonsils and to a lesser extent in 
■ Spleen, and Neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, immune disorders, infectious disorders, and cancers. Similarly, -• 
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polypeptides and. antibodies directed to these polypeptides are useful in providing, 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune 
disorders, infectious disorders, and cancers, expression of this gene at significantly 

. 5 higher or lower levels may be routinely detected in certain tissues (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal, 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. Preferred epitopes include 

10 those comprising a sequence shown in SEQ ID NO: 437 as residues: GIy-7 to Ser-12. 

• The tissue distribution and homology to lactate dehydrogenase gene indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for . 
diagnosis and treatment of immune disorders, infectious disorders, and cancers. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 205 

• The translation product of this-gene shares sequence homology with Gcapi 
protein which is developmentally regulated in brain. 

This gene is expressed primarily in placenta and endometrial tumor and to a 
lesser extent in several other tumors. 
20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, vasculogenesis/angiogenesis and tumorigenesis. Similarly, polypeptides - 
and antibodies directed to these polypeptides are useful in providing immunological 
25 probes for differential identification of the tissue(s) or cell type(s). For a number of 
> disorders of the above tissues or cells, particularly of the vascular system and tumors, 
" expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
. plasma, urine, synovial fluid or spinal, fluid) or another tissue or cell sample taken from 
30 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in -healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to Gcap 1 protein indicates that " 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
35 arid treatment of disorder or dysfunction of vascular system of tumorigenesis. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 206 

in specific embodiments, polypeptides of the invention comprise the sequence 
MP Y A Q WL AEND RJFEE AQ KAFHKAGRQ RE A (SEQ ID NO:799); 
. VQVLEQLTNNAV^SRFNDAi^YYT^ILSMQCLDIAQD (SEQ ID NO:794); 
5 PAQKDTMLGKFY'HFQRLAELYHGYHAIHRHTEDP (SEQ ID NO: 795); 
FSVHRPETLFNISRFLLHSLPKDTPSGISKVKILFT (SEQ ID NO:800); 
LAKQSKALGAYRLARHAYDKLRGLYIP (SEQ ID NO:796); ARFQICSELG 
TLTIRAKPFHDSEELVPLCYRCSTNN (SEQ ID NO: 797); and/or PLLNNLGNVC 
INCRQPFIFSASSYDVLHLVEFYI^EGITDEEAISL^ 
10 LPDSCG (SEQ ID NO:798). - Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily in testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceil type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions, which include, but are 
not limited to, male reproductive and endocrine disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells; particularly of the reproductive, and endocrine systems, 

20. expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spina] fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

25 disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of male reproductive and endocrine 
disorders. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 207 
This gene is expressed in fetal lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as" 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions, which include, but are 
^ 35 not limited to, lung diseases such as cystic fibrosis. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes" 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
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of the above tissues or cells, particularly of the respiratory system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual . 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the ■ 
disorder. Preferred epitopes include those comprising a sequence shown, in SEQ ID 
NO: 440 as residues: Tyr-49 to Cys-54. - ' 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are 'useful for detection and treatment of disorders associated 
with developing lungs particularly in premature infants where the lungs are the last 
tissues to develop. The tissue distribution indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and intervention of 
lung tumors since the gene may be involved in the regulation of cell division, ■ 
particularly since it is expressed in fetal tissue. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and immunotherapy targets for 
the above listed tumors and tissues.-'' 
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Table 1 summarizes the information corresponding to each "Gene No/' described 
above; The nucleotide sequence identified as "NT SEQ ID NO:X" was assembled from- 
partially homologous ("overlapping") sequences obtained from the "cDNA clone ID" 
identified in Table 1 and, in some cases, from additional related DNA clones. The ' 
5 overlapping sequences were assembled into a single contiguous sequence of hish 
redundancy (usually three to five overlapping sequences at each nucleotide position), 
resulting in a final sequence identified as SEQ ID NO:X. 

The cDNA Clone ID was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits contain 
10 multiple different clones corresponding to the same gene. "Vector" refers to the type of 
vector contained in the cDNA Clone ID. 

"Total NT Seq." refers to' the total number of nucleotides in the condg identified 
- by "Gene No." The deposited clone may contain all' or most of these sequences, 
. reflected by the nucleotide position indicated as "5/ NT of Clone Seq. ,? and the "3' NT 
- 1 5 of Clone Seq" of SEQ ID NO:X. The nucleotide position of SEQ ID NO:X of the 
putative start codon (methionine) is identified as "5* NT of Start Codon." Similarly , 
the nucleotide posidon of SEQ ID NO.X of the predicted signal sequence is identified as 
- • "5' NT of First AA of Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is identified 
20 as "AA SEQ ID NO:Y," although other reading frames can also be easily translated 
using known molecular biology techniques. The polypeptides produced by these 
alternative open reading frames are specifically contemplated by the present invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
peptide is idendfied as "First AA of Sig Pep" and "Last AA of Sig Pep." The predicted 
25 first amino acid position of SEQ ID NO: Y of the secreted portion is identified as 

"Predicted First AA of Secreted Portion." Finally, the amino acid posidon of SEQ ID 
NO:Y of the last amino acid in the open reading frame is identified as "Last AA of 
ORF." 

SEQ ID NO:X and the translated SEQ ID NO: Y are sufficiently accurate and 
30 otherwise suitable for a variety of uses well known in the an and described further 

below. For instance, SEQ ID NO:X is useful for designing nucleic acid hybridization 
probes that will detect nucleic acid sequences contained in SEQ ID NO:X or the cDNA 
contained in the deposited clone. These probes will also hybridize to nucleic acid 
. molecules in biological samples,, thereby enabling a variety of forensic and diagnostic 
35 methods of the invention. Similarly, polypeptides identified from SEQ ID NO:Y may 
be used to generate antibodies which bind specifically to the secreted proteins encoded 
by the cDNA clones identified in Table 1. 
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Nevertheless, DNA sequences generated by sequencing reactions can contain ^ 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino acid 

5 sequence. In. these. cases, the predicted amino acid sequence diverges from the actual 
'amino acid sequence, even though the generated DNA sequence may be greater than 
99.9% identical to the actual DNA sequence (for example, one base insertion or deletion 
b in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 

10 sequence or the amino acid sequence, the present invention provides not only the 

generated nucleotide sequence identified as SEQ ID NO:X and the predicted translated 
amino acid sequence identified as SEQ ID NO.Y, but also a sample of plasmid DNA 
containing a human cDNA of the invention deposited with the ATCC, as set forth in 
Table 1. The nucleotide sequence of each deposited clone can readily be determined by 

15 sequencing the deposited clone in accordance with known methods. The predicted 
amino acid sequence can then be verified from such deposits. Moreover, the amino 
acid sequence of the protein encoded by a particular clone can also be directly 
determined by peptide sequencing or by expressing the protein in a suitable host cell • 
containing the deposited human cDNA, collecting the protein, and determining its 

20 sequence. 

The present invention also relates to the genes corresponding to SEQ ID NO:X, 
SEQ ID NO.Y, or the deposited clone. The corresponding gene can be isolated in 
accordance with known methods using the sequence information disclosed herein. 
Such methods include preparing probes or primers from the disclosed sequence and 
25 identifying or amplifying the corresponding gene from appropriate sources of genomic 
material. 

Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from the 
sequences provided herein and screening a. suitable nucleic acid source for the desired 
30 homologue. 

The polypeptides of the invention, can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
. polypeptides, synthetically produced polypeptides, or polypepddes produced by a 
combination of these- methods. Means for preparing such polypeptides are well 
35 understood in the art. 

The polypeptides may be in the form of the secreted protein, including the 
mature form, or may be a pan of a larger protein, such as a fusion protein (see below). 
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It is often advantageous to include an additional amino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 

5 The polypeptides of the present invention, are preferably provided in an isolated 

form, and preferably are substantially purified. A recombinant! y produced version of a 
polypeptide, including the secreted polypeptide, can be substantially purified by the 
one-step method described in Smith and Johnson,, Gene 67:3 1-40 (1988). 
Polypeptides of the invention also can be purified from natural or recombinant sources 

10 using antibodies of the invention raised against the secreted protein in methods which 
are well known in the art. 

Signal Sequences 

Methods for predicting whether a protein has a signal sequence, as well as the 

15 cleavage point for that sequence, are available. For instance, the method of McGeoch, 
Virus Res. 3:271-286 (1985), uses the information from a short N-terminal charged 
region and a subsequent uncharged region of the complete (uncleaved) protein. The 
method of von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the information 
from the residues surrounding the cleavage site, typically residues -13 to +2, where -rl 

20 indicates the amino terminus of the secreted protein. The accuracy of predicting the 

cleavage points of known mammalian secretory proteins for each of these methods is in 
the range of 75-80%. (von Heinje, supra.) However, the two methods do not always 
produce the same predicted cleavage poiht(s) for a given protein. 

In the present case, the deduced amino acid sequence of the secreted polypeptide 

25 was analyzed by a computer program called SignalP (Henrik Nielsen et al., Protein- 
Engineering 10:1-6 (.1997)), which predicts the cellular location of a protein based on 
the amino acid sequence. As part of this computational prediction of localization, the 
methods of McGeoch and von Heinje are incorporated. The analysis of the amino acid 
sequences of the secreted proteins described herein by this program provided the results 

30 shown in Table 1 . 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 
vary from organism to organism and cannot be predicted with absolute certainty. . 
Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO.Y which have an N-terminus beginning within 5 residues (i.e., + - 

35 , or - 5 residues) of the predicted cleavage point. Similarly, it is also recognized that in 
some cases, cleavage of the signal sequence from a secreted protein is not entirely 
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uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present invention. 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For example, the naturally 
5 occurring signal sequence may be further upstream from the predicted signal sequence. 
However, it is likely that the predicted signal "sequence will be capable of directing the 
secreted protein to the ER. These polypeptides, and the polynucleotides encoding such 
polypeptides, are contemplated by the present invention. 

10 Polynucleotide and Polypeptide Variants 

"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential properties 
thereof. Generally, variants are overall closely similar, and, in many regions, identical 
to the polynucleotide or polypeptide of the present invention. 

15 By. a polynucleotide having a nucleotide sequence at least, for example, 95% 

"identical" to a reference nucleotide sequence of the present invention, it is intended that 
the nucleotide sequence of the polynucleotide is identical to the reference sequence 
except that the polynucleotide sequence may include up to five point mutations per each 
100 nucleotides of the reference nucleotide sequence encoding the polypeptide. In other 

20 words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to 
a reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence 
may be deleted or substituted with another nucleotide, or a number of nucleotides up to 
5% of the total nucleotides in the reference sequence may be inserted into the reference 
sequence. The query sequence may be an entire sequence shown inTable 1, the ORF 

25 (open reading frame), or any fragement specified as described herein. 

As a practical matter, whether any^ particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 
■ computer programs.. A preferred method for determing the best overall match between 

30 a query sequence (a sequence of the present invention) and a subject sequence, also 
referred to as a global sequence alignment, can be determined using the FASTDB 
' computer program based on the algorithm of Brutlag et al. (Comp. App. Biosci. (1990) 
6:237-245). In a sequence alignment the query and subject sequences are both DNA 
sequences. An RNA sequence can be compared by convening U's to T ? s. The result 

35 of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB alignment of DNA sequences to calculate percent identiy are: 
Matrix=Unitary, k-tuple=4, Mismatch Penalty=L Joining Penalty=30 ? Randornization 
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Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window 
Size=500 or the lenght of the subject nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5 ? or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is becuase the FASTDB program does not account for 5 7 and 3' 
truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5- or 3 ? ends, relative to the the query sequence, the percent 
identity is corrected by calculating the number of bases of the query sequence that are 5' 
and 3' of the subject sequence, which are not matched/aligned, as a percent of the total 
bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from 
the percent identity, calculated by the above FASTDB program using the specified ~ 
parameters, to arrive at .a final percent' identity score. This corrected score is what is " 
used for the purposes of the present invention. Only bases outside the 5' and 3- bases 
of the subject sequence, as displayed by the FASTDB alignment, which are not 
matched/aligned with the query sequence, are calculated for the purposes of manually 
adjusting the percent identity score. ; 

For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determine percent identity! The deletions occur at the 5' end of the subject 
sequence and therefore, the FASTDB alignment does not show a matched/alignement of 
the first 10 bases at 5 ? end. The 10 unpaired bases represent 10% of the sequence 
(number of bases at the 5' and 3' ends not matched/total number of bases in the query 
sequence) so 10% is subtracted from the percent identity score calculated by the 
FASTDB program. If the remaining 90 bases were perfectly matched the final percent 
identity would be 90%. In another example, a 90 base subject sequence is compared 
with a 100 base query sequence., This time the deletions are internal deletions so that 
there, are no bases on the 5 T or 3' of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not manually 
corrected. Once again, only bases 5' and 3 T of the subject sequence which are not 
matched/aligned with the query sequnce are manually corrected for. No other manual 
corrections are to made for the purposes of the present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query, sequence 
except that the subject polypeptide sequence may include up to five amino acid 
alterations per each 100 amino acids'of the query ammo acid sequence. In other words, 
to obtain a polypeptide having an amino acid sequence at least 95% identical to a query 
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amino acid sequence, up to 5% of the amino acid residues in the subject sequence may 
be inserted, deleted, (indels) or substituted with another amino acid. These alterations 
of the reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, 
. 5 interspersed either individually among residues in the reference sequence or in one or 
more contiguous groups within the reference sequence. 

. As a practical matter, whether any particular polypeptide is at least 90%, 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences shown in 
Table 1 or to the amino acid sequence encoded by deposited DNA clone can be 
10 determined conventionally using known computer programs. A preferred method for 
determing the best overall match between a query sequence (a sequence of the present 
. v - invention) and a subject sequence, also referred to as a global sequence alignment, can 
be determined using the FASTDB computer program based on the algorithm of Brutlag 
et-al. (Comp. App. Biosci. (1990) 6:237-245). In a sequence alignment the query and 
15 subject sequences are either both nucleotide sequences or both amino acid sequences. 
The result of said global sequence alignment is in percent identity. Preferred parameters 
used in a FASTDB amino acid alignment are: Matrix=PAM 0 t k-tuple=2, Mismatch 
Penalty=l, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=I,. 
Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window 
20 ' Size=500 or the length of the subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, a manual correction must be made 
to the results. This is becuase the FASTDB program does not account for N- and C- 
terminal truncations of the subject sequence when calculating global percent identity. 
25 For subject sequences truncated at the N- and Otermini, relative to the the query 

. sequence, the percent identity is corrected by calculating the number of residues of the 
query sequence that are N- and C-terminal of the subject sequence, which are not 
. matched/aligned with a corresponding subject residue, as a percent of the total bases of 
the query sequence. Whether a residue is matched/aligned is determined by results of 
30 the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified' parameters, to 
arrive at a final percent identity score. This final percent identity score is what is used 
for the purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are 
35 considered for the purposes of manually adjusting the percent identity score. That is, 
only query residue positions outside the farthest N- and C-terminal residues of the 
subject sequence. 
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For example,- a 90 amino acid residue subject sequence is aligned with. a J00 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not show 
a matching/alignment of the first 10 residues at the N- terminus. The 10 unpaired 

5 residues represent 10% of the sequence (number of residues at the N- and C- termini 
not matched/total number of residues in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 
residues were perfectly matched the, final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue jquery sequence. 
, 10 This time -the deletions are internal deletions so there are no residues at the N- or C- 
termini of the subject sequence which are not matched/aligned with the query. In this 
case the percent identity calculated by FASTDB is not manually corrected. Once again, 
only residue positions outside the N- and C-terminal ends of the subject sequence, as 
displayed in the FASTDB alignment, which are not matched/aligned with the query 

15 sequnce are manually corrected for/ No other manual corrections are to made for the 
purposes of the present invention, " - 

The variants may contain alterations in the coding regions, non-coding regions, 
or both. Especially preferred are polynucleotide variants containing alterations which 
produce silent substitutions, additions, or deletions, but do not alter the properties or 
.20 activities of the encoded polypeptide. Nucleotide variants produced "by silent 

substitutions due to the degeneracy of the genetic code are preferred. Moreover, 
variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or added in any 
combination are also preferred: Polynucleotide variants can be produced for a variety 
of reasons, e.g., to optimize codon expression for a particular host (change codons in 

25 the human mRNA to those preferred by a bacterial host such as E. coli). 

Naturally occurring variants are called "allelic variants," and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 
organism. (Genes It, Lewin, B., ed., John Wiley & Sons, New York (1985).) .These 
allelic variants can vary at either the polynucleotide and/or polypeptide level. 

30 Alternatively, non-naturally occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA 
technology, variants may be generated to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or more amino acids can be 

35 deleted from the N-terminus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron et al., J. Biol. Chem. 268: 2984-2988 
(1993), reported variant KGF proteins having heparin binding activity even after 
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deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon gamma 
exhibited up to ten times higher activity after deleting 8-10 ammo acid residues from the 
carboxy terminus of this protein. (Dobeli et al., J. Biotechnology 7:199-216 (1988).) 
Moreover, ample evidence demonstrates that variants often retain a biological 
5 activity similar to that of the naturally occurring protein. For example, Gayle and 
coworkers (J. Biol. Chem 268:22105-221 1 1 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis- to generate over 
3,500 individual IL-la mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the-molecule. Multiple mutations were examined at every possible - 

10 amino acid" position. The investigators found that "[m]ost of the molecule could be 

altered with little effect on either [binding or biological activity]." (See, Abstract.) In 
fact, only 23 unique amino acid sequences, out of more than 3,500 nucleotide 
sequences examined, produced a protein that significantly differed in activity from wild- 
'type. . 

15 Furthermore, even if deleting one or more amino acids from the N-terminus or 

C-terminus of a polypepdde results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of a 
deletion variant to induce and/or to bind antibodies which recognize the secreted form 
will likely be retained when less than the majority of the residues of the secreted form 

20 are removed from the N-terminus or C-terminus. Whether a particular polypeptide 

lacking N- or C-terminal residues of a protein retains such immunogenic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. 

Thus, the invention further includes polypeptide variants which show 
25 substantial biological activity. Such variants include deletions, insertions, inversions, 
repeats, and substitutions selected according to general rules known in the art so as 
have little effect on activity. For example, guidance concerning how to make 
phenotypically silent amino acid substitutions is provided in Bowie, J. U. et al., . . 
Science 247:1306-1310 (1990), wherein the authors indicate that there are two main 
30 strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in 
different species,, conserved amino acids can be identified. These conserved amino 
aeids are likely important for protein function. In contrast, the amino acid positions 
35 where substitutions have been tolerated by natural selection indicates that these 

positions are not critical for protein function. Thus, positions tolerating amino acid 
substitution could be modified while still maintaining biological activity of the protein. 
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- The second strategy uses genetic engineering, to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein function. For 
example, site directed mutagenesis or alanine -scanning mutagenesis, (introduction of 
single alanine mutations at every residue in the molecule) can be used. (Cunningham 
5 and Wells, Science 244:108 1-1085 (1989).) The resulting mutant molecules can then 
be tested for biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which 
amino acid changes are likely to be permissive at certain amino acid positions in the 

10 protein. For example, most buried (within the tertiary structure of the protein) amino 
acid residues require nonpoiar side chains, whereas few features of surface side chains 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Vai, Leu and He; 
replacement of the hydroxyl residues Ser and Thr; replacement of the acidic residues 

15 Asp and Glu; replacement of the amide residues Asn and Gin, replacement of the basic 
residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, and Tqv 
and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
include (i) substitutions with one or more of the non-conserved amino acid residues, 

20 where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substituent group, or (iii) fusion of the mature polypeptide with another compound, 
such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino 

25 . acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

For example, polypeptide variants containing amino acid substitutions of 
charged amino acids with other charged or neutral amino acids may produce proteins 

30 with improved characteristics, such as less aggregation. Aggregation of pharmaceutical 
formulations both reduces activity and increases clearance due to the aggregate's 
immunogenic activity. (Pinckard et.al., Clin. Exp. Immunol. 2:33 1-340 (1967); 
Robbins et al., Diabetes 36: 838-845 (1987); Cleland et aL, Cm. Rev. Therapeutic 
Drug Carrier Systems 10:307^377 (1993).) 
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Polynucleotide and Polypeptide Fragments 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 
5 15 nt, and more preferably at least about 20 nt, still more preferably at least about 30 nt, 
and even more preferably, at least about 40 nt in length. A fragment "at Feast 20 nt in 
length," for example, is intended to include 20 or more contiguous bases, from .the 
cDNA sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO.X. These nucleotide fragments are useful as diagnostic- probes and primers 
10 as discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 2000 
nucleotides) are preferred. \ 

Moreover; representative examples of polynucleotide fragments of the - . 
invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401- 
15 450, 451-500, 501-550, 551-600, 651-700, 701-750, 751-800, 800-850, 851-900, 
901-950, 951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 
. 1251-1300, 1301-1350, 1351-1406, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 
1601-1650, 1651-1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901-1950, 
195 1-2000, or 2001 to the end of SEQ ID NO:X or the cDNA contained in the 
20 deposited clone. In this context "about" includes the particularly recited ranges, larger 
or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. 
Preferably, these fragments encode a polypeptide which has biological activity. More 
preferably, these polynucleotides can be used as probes or primers as discussed herein. 
In the present invention, a "polypeptide fragment" refers to a short amino acid 
25 sequence contained in SEQ ID NO: Y or encoded by the cDNA contained in the 

deposited clone. Protein fragments may be "free-standing," or comprised wkhin a 
larger polypeptide of which the fragment forms a pan or region, most preferably as a - 
single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20, 2 1-40, 
30 41-60,61-80,81-100, 102-120,' 121-140, 141-160, or 161 to the end of the coding 

region. Moreover, polypeptide fragments can be about 20, 30,. 40, 50, 60, 70, 80, 90, 
100, 1 10, 120, 130, 140, or 150 .amino acids in length. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
amino acids, at either extreme or at both extremes. 
35 Preferred polypeptide fragments include the secreted protein as well as the 

mature form. Further preferred polypeptide fragments include the secreted protein or 
the mature form having a continuous series of deleted residues from the amino 6r the , 
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carboxy terminus, or both. For example, any number of amino acids, ranging from- 1 - 
• 60, can be deleted from the amino terminus of either the secreted polypeptide or the 
mature form. Similarly, any number of amino acids, ranging from 1-30, can be deleted 
from the carboxy terminus of the secreted protein or mature form. Furthermore, anv 
5 . combination of the above amino and carboxy terminus deletions are preferred. 

Similarly, polynucleotide fragments encoding these polypeptide fragments are also 
preferred. 

Particularly, N-terminal deletions of the polypeptide of the present invention can 
be described by the general formula m-p, where p is the total number of amino acids in 
10 the polypeptide and m is an integer from 2 to (p-1), and where both of these integers (m 
& p) correspond to the position of the amino acid residue identified in SEQ. ID NO:Y. 

Moreover, C-terminal deletions of the polypeptide of the present invention can 
also -be described by the general formula 1-n, where n is an integer from 2 to (p-1), and 
again where these integers (n & p) correspond to the position of the amino acid residue 
1 5 identified in SEQ ID NO: Y. ' ■ 

The invention also provides polypeptides having one or more amino acids 
deleted from both the amino and the carboxyl termini, which may be described 
, generally as having residues m-n of SEQ ID-NO: Y, where m and n are integers as . 
described above. 

20 Also preferred are polypeptide and polynucleotide fragments characterized by 

structural or functional domains, such as fragments that comprise alpha-helix and alpha- 
helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 

25 forming regions, substrate binding region, and high antigenic index regions. 

Polypeptide fragments of SEQ ID NO: Y falling within conserved domains are - 
specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are also contemplated. 

Other preferred fragments are biologically active fragments. Biologically active 

30 fragments are those exhibiting activity similar, but not necessarily identical, to an 
activity of the polypeptide of the present invention. The biological activity' of the 
fragments may include an improved desired activity, or a decreased undesirable activity. 

Epitopes & Antibodies 

35 In the present invention, "epitopes" refer to polypeptide fragments having 

antigenic or immunogenic activity in an animal, especially in a human. A preferred 
embodiment of the present invention relates to a polypeptide fragment comprising an 
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epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
molecule to which an antibody can bind is defined as an "antigenic epitope." In 
. contrast, an "immunogenic epitope" "is defined as a part of a protein that elicits an 
antibody response. (See, for instance, Geysen et ah, Proc. Natl. Acad. ScL USA 
5 81:3.998-" 4002 (1983).) . 

Fragments which function as epitopes may be produced by any conventional 
means. (See, e.g., Houghten, R. A. T Proc. Natl. Acad. Sci. USA. 82:5131-5135 
(1985) further described in U.S. Patent No, 4,631,21 1.) 

; In the present invention, antigenic epitopes preferably contain a sequence of at 
10 least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson et 
al., Cell 37:767-778 (1984); Sutcliffe, J. G. et ah. Science 219:660-666 (1983).) 

Similarly, immunogenic epitopes can be used to induce antibodies according to 
15 methods well known in the an. . (See, for instance, Sutcliffe- et al., supra; Wilson et al., 
supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et 
al., J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes 
the secreted protein. The immunogenic epitopes may be presented together with a 
carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if 
20 it is long enough (at least about 25 amino acids), without a carrier. However, 

immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be 
sufficient to raise antibodies capable of binding to, at the very least,' linear epitopes in a 
denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
,25 meant to include intact molecules as well as antibody fragments (such as, for example, 
Fab and F(ab')2 fragments) which are capable of specifically binding to protein. Fab ' 
and F(ab')2 fragments lack the Fc fragment of intact antibody, clear more rapidly from 
the circulation, and may have less non-specific tissue binding than an intact antibody. 
• fWahl et al., J. Nucl. Med. 24:316-325' (1983).) Thus,- these fragments are preferred, 
30^ as well as the products of a FAB or other immunoglobulin expression library. 

Moreover, antibodies of the present invention include chimeric, single chain, and 
• humanized antibodies. 

Fusion Proteins 

35 Any polypeptide of the present invention can be used to generate fusion 

proteins. For example, the polypeptide of the. present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the * 
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polypeptide of the present invention can be used to indirectly detect the second protein 
by binding to the polypeptide. Moreover, because secreted proteins target cellular 
. locations based on trafficking signals, the polypeptides of the present invention can be 
used as targeting molecules- once fused to other proteins. 
5 Examples of domains that can be fused to polypeptides of the present invention 

include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through 
linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics of 

10 .the polypeptide of the present, invention. For instance, a region of additional ammo 
acids, particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence during purification from the host cell or 
'.subsequent handling and storage. Also, peptide moieties may be added to the 
polypeptide to facilitate purification. Such regions may be removed prior to final 

15 preparation of the polypeptide. The addition of peptide moieties to facilitate handling of 
polypeptides are familiar and routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with pans of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 

20 . facilitate purification and show an increased half life in vivo. One reported example 
describes chimeric proteins consisting of the first two domains of the 'human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains of 
mammalian immunoglobulins. (EP A 394,827; Traunecker et aL, Nature 331:84-86 
(1988).) Fusion proteins having disulfide-Iinked dimenc structures (due to the IgG) 

25 can also be more efficient in binding and neutralizing other molecules, than the 
monomeric secreted protein or protein fragment alone.^ (Fountoulakis et aL, J. 
Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion, 
proteins comprising various portions of constant region of immunoglobulin molecules 

30 together with another human protein or part thereof. In many cases, the Fc part-in- a 
fusion protein is beneficial' in therapy and diagnosis, and thus can result in, for 
example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively, 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder therapy and diagnosis if the 

35 - fusion protein is used as an antigen for immunizations.' In drug discovery, for 

example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. -(See.-D. 



WO 98/54963 



PCT/US9S/1 1422 



Bennett et aL, J. Molecular Recognition 8:52-58 (1995); K. Johanson et aL, J: Biol. 
' Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. In 
5 preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, 
such as the tag provided in a pQE vector (QIAGEN,. Inc., 9259 Eton Avenue, 
Chatsworth, CA, 91311), among others, many of which are commercially available. 
As described in Gentz et aL, Proc. Natl. Acad. Sci. USA 86:S2 1-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fusion protein. 
10 Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 

derived from the influenza hemagglutinin protein. (Wilson et al., Cell 37:767 (1984).) 

- Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

15 Vectors. Host Cells, and Protein Production 

The present invendon also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
vector. Retroviral vectors inky be replication. competent or replication defective. In the 

20 latter case, viral propagation generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such 
as. a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is 
a virus, it may be packaged in vitro using an appropriate packaging cell line and then 

25 transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac . ' 
promoters, the SV40 early and" late promoters and promoters of retroviral LTRs, to 
name a few. Other suitable promoters will be known to the skilled artisan. The 

30 expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (UAA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be" translated. 

35 As indicated, the expression vectors will preferably include at least one 

selectable marker. Such markers include dihydrofolate reductase, G418 or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillin resistance 
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genes for culturingin E. coli and other bacteria. Representative examples of 
appropriate hosts include, but are not limited to, bacterial cells, such as E..coli, 
Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS, 
5 293, and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNHSA, 
pNHl6a, pNH18A, pNH46A. available from Sxracagene Cloning Systems, Inc.; and 

10 ptrc99a, pKK223-3, pKK23:3-3, pDR540, pRITS available from Pharmacia Biotech, 
Inc. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan. 
Introduction of the construct into the host cell can be effected by calcium 

15 phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation.' transduction, infection, or other methods. Such methods 
are described in many standard laboratory manuals, such.as Davis et al., Basic Methods 
In Molecular Biology (1986V It is specifically contemplated that the polypeptides of the 
present invention may in fact be expressed by a host cell lacking a recombinant vector. 

20 A polypeptide of this invention can be recovered and purified from recombinant 

cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, aeid,extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 

25 preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. * 

Polypeptides of the present invention, and preferably the secreted form, can also 
* be recovered from: products purified from natural sources, including bodily fluids, 
tissues 1 and cells, whether directly isolated or cultured; products of chemical synthetic 

30 procedures; and products produced by recombinant techniques from a prokaryotic or 
eukaryotic host, including, for example, bacterial, yeast, higher plant, insect, and 
mammalian cells. Depending ' upon the host employed in a recombinant production 
. procedure, the polypeptides of the present invention may be glycosylated or may be 
non-glycosylated. In addition, polypeptides of the invention may also include an initial 

35 modified methionine residue, in some' cases as a result of host-mediated processes. 
Thus, it is well known in the art that the N-terminal methionine encoded by the 
translation initiation codon generally is removed with high efficiency from any: protein. 
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after translation in all eukaryotic cells. While the N-terminai methionine on most 
proteins also is efficiently removed in most prokaryotes, for some proteins, this 
prokaryotic removal process is inefficient, depending on the nature of the amino acid to 
which the N-terminal methionine is-covalently linked. 

5 

Uses of the Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be- considered exemplary and utilizes 
known techniques. 

10 The polynucleotides of the- present invention are useful for chromosome - 

identification. There exists an ongoing need to identify new chromosome markers, 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each polynucleotide of the present invention 
■ can be used as a chromosome marker. . 
15 Briefly, sequences can be mapped to chromosomes by preparing PCR primers 

(preferably 15-25 bp)" from the sequences shown in SEQ ID NO:X. Primers can be ' 
selected using computer analysis so. that primers do not span more than one predicted 
exon in the genomic DNA. These primers, are then used for PCR screening of somatic 
cell hybrids containing individual human chromosomes. Only those hybrids containing. 
20 the human gene corresponding to the SEQ ID NO:X will yield an amplified fragment. 
Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides can 
be achieved with panels of specific chromosome fragments". Other gene mapping * 
25 strategies that can be used include in situ hybridization, prescreening with labeled flow- 
sorted chromosomes, and preselection by hybridization to construct chromosome 
specific -cDN A libraries. 

Precise chromosomal location of the polynucleotides can also be achieved using 
1 fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This 
30 technique uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 
2,000-4,000 bp are preferred. For a review of this technique, see Verma et.al., 
"Human Chromosomes: a Manual of Basic Techniques," Pergamon Press, New York 
- (1988). 

For chromosome mapping," the polynucleotides can be used individually (to 
35 , mark a single chromosome or a single site on that chromosome) or in, panels (for 
marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
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more likely conserved within gene families, thus increasing the chance of cross 
hybridization during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 
physical position of the polynucleotide can be used in linkage analysis. Linkase 
5 -analysis establishes coinheritance between a chromosomal location and presentation of a 
particular disease. (Disease mapping data are found, for example, in V. McKusick, 
Mendelian Inheritance in Man (available on line through Johns Hopkins University 
Welch Medical Library) .) Assuming 1 megabase mapping resolution and one gene per 
20 kb, a cDNA precisely localized to a chromosomal region associated with the disease 

1 0 could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
First, visible structural alterations in the chromosomes, such as deletions or 
translocations, are examined in chromosome spreads or by PCR. If no structural 

15 alterations exist, the presence of point mutations are ascertained. Mutations observed in 
some or all affected individuals, but not in .normal individuals, indicates that the 
mutation may cause the disease. ' However, complete sequencing of the polypeptide and 
the corresponding gene from several normal individuals is required to distinguish the 
mutation from a polymorphism. If a new polymorphism is identified, this polymorphic 

20 polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene' in affected 
individuals as compared to unaffected individuals can be assessed using 
polynucleotides of the present invention. Any of these alterations (altered expression, 
chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 

25 marker. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DNA or RNA. Both methods 
rely on binding of the polynucleotide to DNA or RNA. For these techniques, preferred 
• polynucleotides are usually 20 to 40 bases in length and complementary to either the 

30 region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids 

Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et al., Science 
251:1360 (1991) ) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene Expression, CRC 
Press, Boca Raton, FL (1988).) Triple helix formation optimally results in a shut-off 

35 of RNA transcription from DNA, while antisense RNA hybridization blocks translation 
of an mRNA molecule into polypeptide. Both techniques are effective in model 
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systems, and the information disclosed herein can be used to design antisense or triple 

helix polynucleotides in an effort to treat disease. 

Polynucleotides of the present invention are also useful in gene therapy. One 

goal of gene therapy is to insert a normal gene into an organism having a defective 
5 gene, in an effort to correct the genetic defect. The polynucleotides disclosed in the 

present invention offer a means of targeting such genetic defects in a highly accurate 

manner. Another goal is to insert a new gene' that was not present in the host genome, 

thereby producing a new trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 
10 biological samples. The United States military, for example, is considering the use of 

restriction fragment length polymorphism (RFLP) for identification, of its personnel. In 

this technique, an individual's genomic DNA is digested with one or more restriction 

enzymes, and probed on a Southern blot to yield unique bands for identifying . 

personnel. This method does not suffer from the current limitations of "Dog Tags" 
15 which can be lost, switched, or stolen, making .positive identification difficult. The 

polynucleotides of the present invention can be used as additional DNA markers for 

RFLP. 

The polynucleotides of the present invention can also be used as an alternative to 
RFLP, by determining the actual base-by-base DNA sequence of selected portions of an 

20 individual's genome. These sequences can be used to prepare PCR primers for 

amplifying and isolating such selected DNA, which can then be sequenced. Using this 
technique, individuals can be identified because each individual will have a unique set 
of DNA sequences. Once an unique ID database is established for an individual, 
positive identification of that individual, living or dead, can be made from extremely 

25 small tissue samples. 

Forensic biology also benefits from using DN A-based identification techniques 
as disclosed herein. DNA sequences taken from very small biological samples such as 
tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, etc., can be 
amplified using PCR. In one prior an technique, gene sequences amplified from 

30 polymorphic loci, such as DQ'a class II HLA gene, are used in forensic biology to 

identify individuals. (Erlich, H., PCR Technology, Freeman and Co. (1992).) Once 
these specific polymorphic loci are amplified,:.they are digested with one or more 
restriction enzymes, yielding an identifying set of bands on a Southern blot probed with 
DNA corresponding to the DQa class E HLA gene. Similarly, polynucleotides of the 

35 present invention can be used as polymorphic markers for forensic purposes. 

There, is also a need for. reagents capable of identifying, the source of a particular 
tissue. Such need-arises, for example, in forensics when presented with tissue of '" 
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unknown origin. Appropriate reagents can comprise, for example, DNA probes or 
primers specific to particular tissue prepared from the sequences of the present 
invention. Panels of such reagents can identify tissue by species and/or by organ type. 
In a similar fashion, these reagents can be used to screen tissue cultures for 
5 contamination. 

In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences 
in the process of discovering novel polynucleotides, for selecting and making oligomers 
10 for attachment to a "gene, chip" or other support, to raise anti-DNA antibodies using 
DNA immunization techniques, and as an antigen to elicit an immune response. 

Uses of the Polypeptides 

Each of the polypeptides identified herein can be used in numerous ways. The 
15 following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody-based techniques. For example, protein expression in 
- tissues can be studied with classical immunohistological methods. (Jalkanen, M., et 
* al., J. Cell. Biol. 101:976-985 (1985); Jalkanen, M.;et aL, J. Cell . Biol. 105:3087- 
20 3096 (1987).) Other antibody-based methods useful for detecting protein.gene 

expression include immunoassays, such as the enzyme linked immunosorbent assay 
(ELISA) and the radioimmunoassay (RIA). Suitable antibody assay labels are known . 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 
as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), indium (I12In), and 
25 technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
* biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for in vivo 
imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
30 radiography, suitable labels include radioisotopes such as barium or cesium, which emit- 
detectable radiation but are not overtly harmful to the subject. Suitable markers for 
NMR and ESR include those with a detectable characteristic spin, such as deuterium, 
which may be incorporated into 'the antibody by labeling of nutrients for the relevant 
hybridoma. 

35 A protein-specific antibody or antibody fragment which has been labeled with 

an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, - 
1 12In, 99mTc), a radio-opaque substance, or a material detectable by nuclear magnetic 
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resonance, is introduced (for example, parenteral!)/, subcutaneously, or 
intraperitoneally) into the mammal. It will be understood in the an that the size of the 
subject and the imaging system used will determine the quantity of imaging moiety 
needed to produce diagnostic images. In the case of a radioisotope moiety, for a human 
5 subject, the quantity of radioactivity injected will normally range from about 5 to 20 
.millicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. In 
vivo tumor imaging is described in S.W. Burchiel et ah, "Immunopharmacokinedcs of 
Radiolabeled Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: The 
10 Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., Masson 
Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which involves 
(a) assaying the expression of a polypeptide of the present invention in cells or body 
fluid of an individual; (b) comparing the level of gene expression with a standard gene 
15 expression level, whereby an increase' or decrease in the assayed polypeptide gene 

expression level compared to the standard expression level is indicative of a disorder. 

Moreover, polypeptides of the present inventioncan be used to treat disease. 
For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 
-20 supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 
for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 
activate the" activity of a polypeptide (e.g., by binding, to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free ligand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 
25 response (e.g., blood vessel growth). 

Similarly, antibodies directed to a polypeptide of the present invention can also 
be used to treat disease. For example, administradon of an antibody directed to a 
' polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an andbody can activate the polypeptide, such • - 
30 as by binding to a polypeptide bound^to a membrane (receptor). 

At the very least, the polypeptides of the present invention can be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods well known to those of skill in the art. Polypeptides can also 
be. used to raise antibodies, which in turn are used to measure protein expression from a 
35 recombinant cell, as a way of assessing transformation of the host cell. Moreover, the 
.polypeptides of the present invention can be used to test the following biological 
activities. 
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Biological Activities 

The polynucleotides and polypeptides of the present invention can tie used in 
assays to test for one or more biological activities. If these polynucleotides and 
5 polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat the associated disease. 

Immune Activity 

10 A polypeptide or polynucleotide of the present invention may be useful in 

treating deficiencies or disorders of the immune system, by activating or inhibiting the . 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop through : a process called hematopoiesis, producing myeloid (platelets, red 
blood cells, neutrophils, and macrophages) and lymphoid (B. and T lymphocytes) cells 

15 from pluripotent stem cells. The etiology of these immune deficiencies or disorders 

may be genetic, somatic, such as cancer or some autoimmune disorders, acquired (e.g., 
by chemotherapy or toxins), or infectious. Moreover, a polynucleotide or polypeptide 
of the present invention can be used as a marker or detector of a particular immune 
' system disease or disorder. 

20 A polynucleotide or polypeptide of the present invention may be useful in 

treating or detecting deficiencies or disorders of hematopoietic cells. A polypeptide or 
polynucleotide of the present invention could be used to increase differentiation and 
proliferation of hematopoietic cells, including the pluripotent stem cells; in an effort to' 
treat those disorders associated with a decrease in certain (or many) types hematopoietic 

25 cells. Examples of immunologic deficiency syndromes include, but are not limited -to: 
blood protein disorders (e.g. agammaglobulinemia, dysgammaglobulinemia), ataxia 
telangiectasia, common variable immunodeficiency, Digeorge Syndrome, HIV 
infection, HTLV-BLV infection, leukocyte adhesion deficiency syndrome, 
lymphopenia, phagocyte bactericidal dysfunction, severe combined immunodeficiency 

30 (SCIDs), Wiskott-Aldrich Disorder, anemia, thrombocytopenia, or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity (clot 
formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to treat blood 

35 coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 

disorders (e.g. thrombocytopenia); or wounds resulting from trauma, surgery, or other 
causes. Alternatively, a polynucleotide or polypeptide of the present invention that can 
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decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the treatment of heart attacks 
(infarction), strokes, or scarring. 

A polynucleotide or polypeptide of the present invention may also be useful in 
5 treating or detecting autoimmune disorders. Many autoimmune disorders result from > 
inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognidon results in an immune response leading to the destruction of the 
host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 
present invention that inhibits an immune response, particularly the proliferation, 

10 differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

. Examples of autoimmune disorders that can be treated or detected by the present 
invention include, but are not limited to: Addison's Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, 

15 glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, Multiple Sclerosis, 
Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, Pemphigus, 
Polyendocrinopathies, Purpura, Reiter's Disease, Stiff-Man Syndrome, Autoimmune 
Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary Inflammation, 
Guillain-Barre Syndrome, insulin dependent diabetes meilitis, and autoimmune 

20 inflammatory eye disease. 

Similarly, allergic reactions and conditions, such- as asthma (particularly allergic 
asthma) or other respiratory problems, may also be treated by a polypeptide or 
polynucleotide of the~present invention. Moreover, these, molecules can be used to treat 
anaphylaxis, hypersensitivity to an antigenic molecule,. or blood group incompatibility. 

25 A polynucleotide or polypeptide of the present invention may also be used to 

treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through an" 
immune response. Similarly, an immune response is also involved in GVHD, but, in 
this case, the foreign transplanted immune cells destroy the host tissues. The 

30 administration of a polypeptide or polynucleotide of the present invention that inhibits 
an immune response, particularly the proliferation, -differentiation, or chemotaxis of T- 
cells, may be an effective therapy in preventing organ rejection or GVHD. 

Similarly, a polypeptide or polynucleotide of the present invention may also be 
used to modulate inflammation. - For example, the polypeptide or polynucleotide may 

35 inhibit the proliferation and differentiation of cells involved in an Inflammatory 

response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including inflammation associated with infection (e.g., septic 
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shock, sepsis, or systemic inflammatory response syndrome (SIRS)), ischemia- 
reperfusion injury , endotoxin lethality, arthritis, complement-mediated hyperacute 
rejection, nephritis, cytokine or chemokine induced lung injury, inflammatory .bowel 
disease. Crohn's disease, or resulting from over production of cytokines (e.g., TNF or 
5 IL-1.) 

Hvperproliferative Disorders 

A polypeptide or polynucleotide can be used to treat or detect hvperproliferative 
disorders, including neoplasms. A polypeptide or polynucleotide of the present 

10 - invention may inhibit the proliferation of the disorder through direct or indirect 

interactions. Alternatively, a polypeptide or polynucleotide of the present invention 
may proliferate other cells which can inhibit the hvperproliferative disorder. 

For example,- by increasing an immune response, particularly increasing 
antigenic qualities of the hvperproliferative disorder or by proliferating, differentiating, 

15- or mobilizing T -cells, hvperproliferative disorders can be treated. This immune 

response may be increased by either enhancing an existing immune response, or by 
initiating a new immune response. Alternatively, decreasing an immune response may 
also be a- method of treating hyperproliferative disorders, such as a chemotherapeutic 
agent. ■ 

20 Examples of hyperproliferative disorders that can be treated or detected by a 

polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancreas, 
peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, .ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 

25 pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly^ other hyperproliferative disorders can also be treated or detected by a 
polynucleotide or polypeptide of the present invention. Examples of such 
hyperproliferative disorders include, but are" not limited to: hypergammaglobulinemia, 
lymphoproliferative disorders, paraproteinemias, purpura, sarcoidosis, Sezary 

30 Syndrome, Waldenstron's Macroglobulinemia, Gaucher's Disease, histiocytosis, and 
any other hyperproliferative disease, besides neoplasia, located in an organ system 
listed above. - 

Infectious Disease 

35 A polypeptide or polynucleotide of the present invention can be used to treat or ' 

detect infectious agents. For example, by increasing the immune response, particularly 
increasing the proliferation and differentiation of B and/or T ceils, infectious diseases' 
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may be treated. The immune response may be increased by either enhancing an existing 
immune response, or by initiating a new immune response. Alternatively, the 
polypeptide or polynucleotide of the present invention may also directly inhibit the 
infectious agent,, without necessarily eliciting an immune response. 
5 Viruses are one example of an infectious agent that can cause disease or 

symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
present invention. Examples of viruses, include, but are not limited to the following 
"DNA and RNA viral families: .Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, 
Birnaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Elaviviridae, 

10 . Hepadnaviridae (Hepatitis), Herpes viridae (such as, Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g., Influenza), Papovaviridae, Parvoviridae, 
Picornaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 
Rotavirus), Retroviridae (HTLV-I, HTLV-II, Lentivirus), and Togaviridae (e.g., 

15 Rubi virus). Viruses falling within these families can cause a variety of diseases or 
symptoms, including, but not limited to: arthritis, bronchiolitis, encephalitis, eye 
infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections. (e.g., AIDS), 
pneumonia, Burkitt's Lymphoma, chickenpox , hemorrhagic fever, Measles, Mumps, 

20 Parainfluenza, Rabies, the common cold, Polio, leukemia, Rubella, sexually 

. transmitted diseases, skin diseases (e.g., Kaposi's, warts), arid virernia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and that 

25 can be treated or detected by a polynucleotide or polypeptide of the present invention 
include, but not limited to, the following Gram-Negative and Gram-positive bacterial 
; families and fungi: . Actinomycetales.(e.g., Corynebacterium, Mycobacterium, 
Norcardia), Aspergillosis, Bacillaceae (e.g.. Anthrax, Clostridium), Bacteroidaceae, 
Blastomycosis, Bordetella, Bonrelia, Brucellosis, Candidiasis, Campylobacter, 

30 Coccidioidomycosis, Cryptococcosis, Dermatocycoses, Enterobacteriaceae (Klebsiella, 
Salmonella, Serratia, Yersinia), Erysipelothrix, Helicobacter, Legioneilosis, 
Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceae (e.g., Acinetobacter, 
' Gonorrhea, Menigococcal), Pasteurellacea Infections (e.g., Actinobacillus, ■ 
Heamophiius, Pasteurelia), Pseudomonas, Rickettsiaceae, Chlamydi'aceae, Syphilis, 

35 and Staphylococcal. These bacterial or fungal families can cause the following diseases 
or symptoms, including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g./ AIDS 
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related infections), paronychia, prosthesis-related infections, Reiter's Disease, 
respiratory tract infections, such as Whooping Cough or Empyema, sepsis, Lyme 
Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, food poisoning", 
Typhoid, pneumonia, Gonorrhea, meningitis, Chlamydia, Syphilis, Diphtheria. 
5 Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, 

impetigo, Rheumatic Fever, Scarlet Fever, sexually transmitted diseases,' skin diseases 
(e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound infections. 
A polypeptide or polynucleotide of the present' invention can be used to treat or detect 
any of these symptoms or diseases. 

10' Moreover, parasitic agents causing disease or symptoms that can be treated or ' 

detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, .Giardiasis, Helminthiasis, 
Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and Trichomonas. 

15 These parasites can cause a variety of diseases or symptoms, including, but not limited 
to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., dysentery, 
giardiasis), liver disease, lung disease, opportunistic infections (e.g., AIDS related), 
Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or polynucleotide 
of the present invention can be used to treat or detect any of these symptoms or 

20 diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient, supplying the cells with a polynucleotide 
of the present invention, and returning the engineered cells to the patient (ex vivo 
25 therapy). Moreover, the polypeptide or polynucleotide of the present invention can be 
used as an antigen in a vaccine to raise an immune response against infectious disease. 

Regeneration 

A polynucleotide or polypeptide of the present invention can be used to 
30 differentiate, proliferate, and attract cells, leading to the regeneration of tissues. (See, 
Science 276:59-87 (1997).) The regeneration of tissues could be used to' repair, 
replace, or protect tissue damaged by congenital defects, trauma (wounds, bums, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarchritis, periodontal 
disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, repenusion 
35 injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal 
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or cardiac), vascular (including vascular endothelium), nervous, hematopoietic, and 
skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs 
without or decreased scarring. Regeneration also may include angiogenesis. 

Moreover, a polynucleotide or polypeptide of the present invention may increase 

5 regeneration of tissues difficult to heal. For example, increased tendon/ligament 
regeneration would quicken recovery time after damage. A polynucleotide or 
polypeptide of the present invention could also be used prophylactically in an effort to - 
avoid damage. Specific diseases that could be treated include of tendinitis, carpal tunnel 
syndrome, and other tendon or ligament defects. A further example of tissue 

10 • regeneration of non-healing wounds includes pressure ulcers,- ulcers associated with 
vascular insufficiency, surgical, and traumatic wounds. 

Similarly , nerve and brain tissue could also be regenerated by using a 
polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases that could be treated using this method include central and 

15 peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
' - neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 

20, -Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
Drager syndrome), could all be treated using the polynucleotide or polypeptide of the 
present invention. 

Chemotaxis 

25 A polynucleotide or polypeptide of the present invention may have chemotaxis 

activity. A chemotaxic molecule attracts or mobilizes cells (e.g., monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial 
cells) to a particular site in the body, such as inflammation,, infection, or site of 
hyperproliferation. The mobilized cells can then fight off and/or heal the particular 

30 trauma or abnormality. 

A polynucleotide or polypeptide of the present invention may increase 
chemotaxic activity of particular cells: These chemotactic molecules can then be used to 
-treat inftammation,Tnfectioh," hyp or any immune system 

disorder by increasing the number of cells targeted xo. a particular location in the body. 

35 For example, chemotaxic molecules can be used to treat wounds and other trauma to 

tissues by attracting immune cells to the injured location.. Chemotactic molecules of the 
present invention can also attract fibroblasts, which can be used to treat wounds'. 
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It is also contemplated that, a polynucieotide or polypeptide of the present 
invention may inhibit chemotactic activity. These molecules could also be used to treat 
disorders. Thus, a polynucleotide or polypeptide of the present invention could be used 
as an inhibitor of chemo taxis. 

5 

Binding Activity 

A polypeptide of the present invention may be used to- screen for molecules that 
bind to the polypeptide or for molecules to which the polypeptide binds. The binding 
of the polypeptide and the molecule may activate (agonist), increase, inhibit 
10 (antagonist),, or decrease activity of the polypeptide or the molecule, bound. Examples 
of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors),or 
small molecules. 

Preferably, the molecule is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 

15 or functional mimetic. (See, Coligan et al., Current Protocols in Immunology 

l(2):Chapter 5 (1991).) Similarly, the molecule can be closely related to the natural 
receptor to which the polypeptide binds,. or at least, a fragment of the receptor capable - 
of being bound by the polypeptide (e.g., active site). In either case, the molecule can - 
be rationally designed using known techniques. ■ 

20 Preferably, the screening for these molecules involves producing appropriate 

cells which express the polypeptide, either as a secreted protein or on the cell 
membrane. Preferred cells include cells from mammals, yeast, Drosophila, or £. coli. 
Cells expressing the polypeptide (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially containing 

25 the molecule to observe binding, stimulation, or inhibition of activity of either the * " 
polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the polypeptide, 
wherein binding is detected by a label, or in an assay involving competition with a 
labeled competitor. Further, the assay may test whether the candidate compound results 

30 in a signal generated by binding to the polypeptide. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with. a. solution containing a polypepdde, measuring polypeptide/molecule 

35 activity or binding, and comparing the polypeptide/molecule activity or binding to a 
standard. 
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Preferably, an ELISA assay can measure polypeptide level or activity in a 
sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
antibody can measure polypeptide level or activity by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 
5 All of these above assays can be used as diagnostic or prognostic markers. The 

molecules discovered-using these assays can be used to treat disease or to bring about a 
particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the 
poiypeptide/molecule. Moreover, the assays can discover agents which may inhibit or 
enhance the production of the polypeptide from suitably manipulated cells or dssues. 

10 Therefore, the invention includes a method of identifying compounds which 

bind to a polypeptide of the invendon comprising the- steps of: (a) incubating a 
candidate binding compound with a polypepdde of the invention; and '(b) determining if 
binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound with - 

15 a poiypeptide of the invention, (b) assaying a biological activity , and (b) determining if 
a biological activity of the polypepdde has been altered. 

Other Activities 

A polypepdde or polynucleotide of the present invention may also increase or 
20 decrease the differentiation or proliferation of embryonic stem cells, besides, as 
discussed above, hematopoietic lineage. 

A polypepdde or polynucleotide of the present invention may also be used to 
modulate mammalian characteristics, such as body height, weight, hair color, eye color, 
skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
25 surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 
used to modulate mammalian metabolism affecting catabolism, anabolism, processing, 
utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invendon may be used to change' 
a mammal's mental state or physical state by influencing biorhythms, caricadic 
30 rhythms, depression (including depressive disorders), tendency for violence, tolerance 
for pain, reproductive capabilides (preferably by Activin or Inhibin-like activity), 
hormonal or endocrine levels, appetite, libido, memory, stress, or other cognidve 

■■- — qualities. : " ." 

A polypepdde or polynucleotide of the present invendon may also be used as a 
35 . food additive or preservative, such as to increase or decrease storage capabilities, fat 
content, lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional 
components. 1 ■" 
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Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
5 to a sequence of at least about 50 contiguous nucleotides in the nucleotide sequence of 
SEQ ID NO:X, wherein X is any integer as denned in Table 1. 

Also preferred is a nucleic acid. molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
10 Clone Sequence and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1 . 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in .the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
15 Stan Codon and ending with the nucleotide at about the position of the 3' Nucleotide of 
the Clone Sequence as defined for SEQ ID NO:X in Table 1 . 

Similarly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 
range of positions beginning with the nucleotide at about the position of the 5* 
20 Nucleotide of the First Amino Acid of the Signal Peptide and ending with the nucleotide 
• at about the position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID 
NO:X in Table 1. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 150 contiguous 
25 nucleotides in thenucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide . 
sequence which is at least 95% identical to a sequence of at least about 500 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NO.X. 

A further preferred embodiment is a nucleic acid molecule comprising a 
. 30 nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
ID NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of the 
First Amino Acid of the Signal Peptide and ending with the nucleotide at. about the 
position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in 
Table 1. 



WO 98/54963 . PCTAJS 98/ 11422 



218 



A further preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to the complete nucleotide 
sequence of SEQ ID NO:X. . . 

Also preferred is an isolated nucleic acid molecule which hybridizes under 
5 stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic acid 
molecule which hybridizes does not hybridize under stringent hybridization, conditions 
to a nucleic acid molecule having a nucleotide sequence consisting of only A residues or 
of only T residues. 

Also preferred is a composition of matter comprising a DN A molecule which 
10 comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1, 
' which DN A molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1 for said 
cDNA Clone Identifier. • 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
15 sequence which is at least 95% identical to a sequence of at least 50 contiguous 

nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 , which DN A molecule is contained in the deposit given the 
ATCC Deposit Number shown in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of at 
.20 least 50 contiguous nucleotides is included in the nucleotide sequence of the complete 
- open reading frame sequence' encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to. sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 
25 A further preferred embodiment is an isolated nucleic acid molecule comprising 

a nucleotide sequence which is at least 95% identical to sequence of at least 500 
contiguous nucleotides in the. nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least '95% identical to the complete nucleotide 
30 sequence encoded. by said human cDNA clone. 

A further preferred embodiment is a method for detecting in a biological sample 
a nucleic acid molecule comprising a nucleoude sequence which is at least 95% identical 
to a sequence of at least 50 contiguous nucleotides in a sequence selected from the 
group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer 
35 as defined in Table 1; and a nucleotide sequence encoded by a human cDNA clone 
' identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1; which method 
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comprises a step of comparing a nucleotide sequence of at lease, one nucleic acid 
molecule in said sample with a sequence selected from said group and determining 
whether the sequence of said nucleic acid molecule in said sample is at least 95% 
identical to said selected sequence. 
5 Also preferred is the above method wherein said step of comparing sequences 

comprises determining the extent of nucleic acid hybridization between nucleic, acid 
molecules in said sample and a nucleic acid molecule comprising said sequence selected 
from said group. Similarly, also preferred is the above method wherein said step of 
comparing. sequences is performed by comparing the nucleotide sequence determined ■ 
10 from a nucleic acid molecule in said sample with said sequence selected from said 

group. The nucleic acid molecules can comprise DNA molecules or RNA molecules. 

A further preferred embodiment is a method for identifying the species, tissue or 
cell type of a biological sample which method comprises a step of detecting nucleic acid 
molecules in said sample, if any, comprising a nucleotide sequence that is at least 95% 
15 identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any 
integer as defined in Table 1 ; and a nucleotide sequence encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1. 
20 The method for identifying the species,, tissue or cell type.of a biological sample 
. can comprise a step of detecting nucleic acid molecules comprising a nucleotide 
sequence in a. panel of at least two nucleotide sequences, wherein at least one sequence 
in said panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in a sequence selected from said group. 
25 Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal, structure or expression of a gene encoding a secreted protein 
identified in Table 1, which method comprises a step of detecting in a biological sample 
obtained from said subject nucleic acid molecules, if any, comprising a nucleotide 
sequence that is at least 95% identical to a sequence of at least 50 contiguous 
30 nucleotides in a sequence selected -from the group consisting of: a nucleotide sequence 
of SEQ ID NO:X wherein X is any integer as defined in Table 1 ; and a nucleotide 
sequence encoded by a human cDN A clone identified by a cDNA Clone Identifier in 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 
~~ - cDNA clone in Table 1. 

35 , The method for diagnosing a pathological condition can comprise a step of 

detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at least 
two nucleotide sequences, wherein at least one sequence in said panel is at least 95% 
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identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
said group. 

Also preferred is a composition of matter comprising isolated nucleic acid 
molecules wherein the nucleotide sequences of said nucleic acid molecules comprise a 
5 panel of at least two nucleotide sequences, wherein at least one sequence in- said panel is 
at least 95% identical. to a sequence of at least. 50 condguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein 
X is any integer as defined in Table 1 ; and a nucleotide sequence encoded by a human ' 
cDNA clone identified by a cDN A Clone Identifier in Table 1 and contained in the 
10 deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. The 
nucleic acid molecules can comprise DNA molecules or RNA molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 90% identical to a sequence of at least about 10 contiguous amino acids in the • 
amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1. 
15 Also preferred is a polypeptide, wherein said sequence of contiguous amino 

acids -is included in the amino acid sequence of SEQ ID NO:Y in the range of positions 
beginning with the residue at about the position of the First Amino Acid of the Secreted 
Portion and ending with the residue at about the Last Amino Acid of the Open Reading 
Frame as set forth for SEQ ID NO: Y in Table 1. 
20 Also preferred is an isolated polypeptide comprising an amino acid sequence at 

- least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 100 contiguous amino acids-in the 
25 amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to the complete amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 90% identical to a sequence of at least about 10 contiguous ammo acids in the v 
30 complete amino acid sequence of a secreted protein encoded by a human cDNA clone ' 
idendfied by a cDNA Clone Identifier in Table 1 and contained in J±ie deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is a polypeptide wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of a secreted portion of the secreted protein 
35 encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table I. 
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Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
cione identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with - 
5 the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also prefenred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDN A 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
1 0 the ATCC Deposit- Number shown for said cDNA clone in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
15 Table 1. 

Further preferred is an isolated antibody which binds specifically to a 
polypeptide comprising an amino aci t d sequence that is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence- selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 

20 defined in Table 1 ; and a complete amino, acid sequence of a protein encoded by a 

human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in 
the deposit with the ATCC Deposit Number shown for said cDN A clone in Table 1 . 

Further preferred is a method for detecting in a biological sample a polypeptide 
comprising an amino acid sequence which is at least 90% identical to a sequence of at 

25 least 10 contiguous amino acids in a sequence selected from the group consisting of: an 
amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 ; which method 

30 comprises a step of comparing an amino acid sequence of at least one polypeptide 
molecule in said sample with a sequence selected from said group and determining 
whether the sequence of said polypeptide molecule in said sample is at least 90% 
identical to said sequence of at least 10 contiguous amino acids. 

Also preferred fs the above" "method wherein safd~step~of compann^~arTamino 

35 acid sequence of at least one polypeptide molecule in said sample with a sequence 
selected from said group comprises determining the extent of specific binding of 
polypeptides in said sample to an antibody which binds specifically to a polypeptide ' ; 
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comprising an amino acid sequence that is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of: an 
amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined' in Table 1; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
5 identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the ■ 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is the above method wherein said step of comparing sequences is 
performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected from said group. 
10 Also preferred is a method for identifying the species, tissue or cell type of a 

- biological sample which method comprises a step of detecting polypeptide molecules in 
said sample, if any, comprising an amino acid sequence that is at least 90% identical to 
a sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is any integer as 
15 defined in Table 1; and a complete amino acid sequence of a secreted protein. encoded 

by a.human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit wich the ATCC Depcisit Number shown for said cDNA clone in Table 1. 

Also preferred is the above method for identifying the species, tissue or cell type 
of a biological sample, which method comprises a step of detecting polypeptide 
20 molecules comprising an amino acid sequence in a panel of at least two amino acid 
. sequences, wherein at least one sequence in said panel is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
25 associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table 1, which method comprises a step of detecting in a biological sample x 
obtained from said subject polypeptide molecules comprising an amino acid sequence in 
a panel of at least two amino acid sequences, wherein at least one sequence in said panel 
is at least 90% identical to a sequence of at least 10 contiguous amino acids in a 
30 sequence selected from the group consisting of: an amino acid sequence of SEQ ID 
NO:Y wherein Y is any integer as defined in Table 1; and a complete amino acid 
sequence of a secreted protein encoded by a human cDNA clone identified by a cDN A 
Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit Number 
. shown for said cDNA clone in Table L 
35 In any of these methods, the step of detecting said polypeptide molecules 

includes using an antibody. 
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Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at least 
90% identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
5 from the group consisting of: an amino acid sequence of SEQ ID NO.Y wherein Y is 
any integer as defined in Table 1 ; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table" 
1 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1 . 

10 ^ so preferred is an isolated nucleic acid molecule, wherein said nucleotide 

sequence encoding a polypeptide has been optimized for expression of said polypeptide 
in a prokaryotic host. 

Also prefenred is an isolated nucleic acid molecule, wherein said polypeptide 
comprises an amino acid sequence selected from the group consisting of: an amino acid 

1 5 sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 ; and a 

complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table l and contained in the deposit with the 
ATCC Deposit Number shown for said cDN A clone in Table 1. 

Further preferred is a method of making a recombinant vector comprising 

20 inserting any of the above isolated nucleic acid molecule into a vector. Also preferred is 
the recombinant vector produced by this method. Also preferred is a method of makins 
a recombinant host cell comprising introducing the vector into a host cell, as well as the ■ 
recombinant host cell produced by this method. 

Also preferred is a method of making an isolated polypeptide comprising 

25 culturing this recombinant host cell under conditions such that said polypeptide is 

expressed and recovering said polypeptide. Also preferred is this method of making an 
isolated polypeptide, wherein said recombinant host cell is a eukaryotic cell and said 
polypeptide is a secreted portion of a human secreted protein comprising an amino acid 
sequence selected from the group consisting of: an amino acid sequence of SEQ ID 

30 NO:.Y beginning with the residue- at the position of the First Amino Acid of the Secreted 
Portion of SEQ ID NO:Y wherein Y is an integer set forth in Table 1 and said position 
of the First Amino Acid of the Secreted Portion of SEQ ID NO:Y is defined in Table 1; 
and an amino acid sequence of a secreted portion of a protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the' 

35 deposit with the ATCC Deposit Number shown for said cDN A clone in Table 1 . The 
isolated polypeptide produced by this method is also preferred. 



WO 98/54963 



PCT/US98/11422 



224 



Also preferred is a method of treatment of an- individual in need of an increased 
level of a secreted protein activity, which method comprises administering, to such an 
individual a pharmaceutical composition comprising an amount of an isolated 
polypeptide, polynucleotide, or antibody of the claimed invention effective to increase 
5 the level of said protein activity in said individual. 

Having generally described the invention, the same- will be more readily 
.understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

10 Examples 

Example 1: Isolation of a Selected cDNA Clone From the Deposited 
Sample 

Each cDNA clone in a cited ATCC deposit is contained in a plasmid vector. 
15 Table 1 identifies the vectors used to construct the cDNA library from which each clone 
was isolated. In many cases, the vector used to construct the library is a phage vector 
from which a plasmid has been excised. The table immediately below correlates the 
related plasmid for each phage vector used in constructing the cDNA library. For 
example, where a particular clone is identified in Table 1 as being isolated in the vector 
20 "Lambda Zap," the corresponding deposited clone is in "pBluescript." 

Vector Used to Construct Library Corresponding Deposited Plasmid 

Lambda Zap pBluescript (pBS) 

Uni-ZapXR pBluescript (pBS) 

.' Zap Express pBK 
25 . lafmidBA plafmidBA 

pSportl pSportl 
pCMVSport 2.0 pCMVSport 2.0 

pCMVSport 3.0 pCMVSport 3.0 

pCR®2.1 pCR 0 2.1 
30 Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap 

XR (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res. 
16:7583-7600 (1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 
17:9494 (1989)) and pBK (Alting-Mees, M. A. et al., Strategies 5:58-61 (1992)) are 
35 commercially available from Stratagene Cloning Systems, Inc., 11011 N. Torrey Pines 
Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and pBK 
contains a neomycin resistance gene. Both can be transformed into E. coli strain XL-1 
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Blue, also available from Stratagene. pBS comes in 4 forms SK-h SK-, KS-h and KS. 
The S-and K refers to the orientation of the poly linker to the T7 and T3 primer 
sequences which flank the polylinker' region ("S" is for Sad and "K" is for Kpnl which 
are the first sites on each respective end of the linker). 'V or refer to the orientation 
5 of the fl origin of replication ("ori."), such that in one orientation, single stranded rescue 
initiated from the f 1 ori generates sense strand DNA and in the other, andsense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from 
Life Technologies, Inc;, P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors 
contain an ampicillin resistance gene and may be transformed into E. coli strain 

10 DH10B, also available from Life Technologies. (See, for instance, Gruber, C. E., et 
al., Focus 15:59 (1993).) Vector lafrnid BA (Bentd Soares, Columbia University, NY) 
contains an ampicillin resistance gene and can be transformed into E. coli strain XL-1 
Blue: Vector pCR 0 2.1, which is available from Invitrogen, 1600 Faraday Avenue, 
Carlsbad, CA 92008, contains an ampicillin resistance gene and may be transformed 

15 into E. coli strain DH10B, available from Life Technologies; (See, for instance, Clark, 
J. M., Nuc. Acids Res. 16:9677-9686 (1983) and Mead, D. et aL, Bio/Technology 9: 
( 1991).) Preferably, a polynucleotide of the present invention does not comprise the 
phage vector sequences identified for the particular clone in Table 1, as well as the 
corresponding piasmid vector sequences designated above. 

20 The deposited materiaHn the -sample assigned the ATCC Deposit Number cited 

in Table 1 for any given cDNA clone also may contain one or more additional plasmids, 
each comprising a cDNA clone different from that given clone. Thus, deposits sharing 
the same ATCC Deposit Number contain at least a piasmid for each cDNA clone 
identified in Table 1. Typically, each ATCC deposit sample cited in Table 1 comprises 

25 a mixture of approximately equal amounts (by weight) of about 50 piasmid DNAs, each 
containing a different cDNA clone; but such a deposit sample may include plasmids for 
more or less than 50 cDNA clones, up to about 500 cDNA clones. 

Two approaches can be used' to isolate a particular clone from the deposited 
sample of piasmid DNAs cited for that clone in Table 1. First, a piasmid is directly 

30 isolated by screening the clones using a polynucleotide probe corresponding to SEQ ID 
NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. 
The oligonucleotide is labeled, for instance, with 32 P-y-ATP using T4 polynucleotide 
35 kinase and purified. according to routine methods. (E.g., Maniatis et aL Molecular 

Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) 
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The plasmid mixtur e is transformed inco a suitable host, as in dicated jibo^velsucaas* 

XL-1 Blue (Stratagene)) using techniques known to those of skill in the art, such as 
those provided by the vector supplier or in related publications or patents cited above. 
The transformants are plated on 1.5% agar plates (containing the appropriate selection " 

5 agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. 
These plates are screened using Nylon membranes according to routine methods for 
bacterial colony screening (e.g., Sambrook et aL, Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989),. Cold Spring Harbor Laboratory Press, pages 1.93 to 
1 .104), or other techniques known to those of skill in the an. 

10 Alternatively, two primers of 17-20 nucleotides derived from both ends of the 

SEQ ID NO.X (i.e., within the region of SEQ ID NO:X bounded by the 5' NT and the 
3* NT of the clone defined in Table 1) are synthesized and used to amplify the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain reaction 
is carried out under routine conditions, for instance; in 25 ul of reaction mixture with 

15 0.5 ug of the above cDNA template. A convenient reaction mixture is 1.5-5 mM 

MgCl 2 , 0.01% (w/v) gelatin, 20 p.M each of dATP, dCTP, dGTP, dTTP, 25 pmol of 
each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation 

at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for 1 min) are 

performed with a Perkin-Elmer Cetus ; automated thermal cycler. The amplified product 

20 is analyzed by agarose gel electrophoresis and the DNA band with expected molecular 
weight is excised and purified. The PCR product is verified to be the selected sequence 
by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of a gene which may not be present in the deposited clone. These methods 

25 include but are hot limited to, filter probing, clone enrichment using specific probes, 

and^protocols similar or identical' to 5' and 3' "RACE" protocols which are well known 
in the art/ For instance, a method similar to 5 ? RACE is available for generating the 
missing 5' end of a desired full-length transcript. (Fromont-Racine et al"., Nucleic Acids 
Res. 21("7):1683-1684 (1993).) 

30 . Briefly, a specific RNA oligonucleotide is iigated to the 5' ends of a population, 

of RNA presumably containing full-length gene RiNA .transcripts. A primer set 
containing a primer specific to the Iigated RNA oligonucleotide and a primer specific to 
a known sequence of the gene of interest is used to PCR amplify the 5' portion of the 
desired full-length gene. This amplified product may then be sequenced and used to 

35 generate the-full length gene. ( 
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This above method starts .with total RNA isolated from the desired source, 
although poly-A+ RNA can be used. The RNA preparation can. then be treated with 
phosphatase if necessary to eliminate 5' phosphate groups on degraded or damaged 
RNA which may interfere with the later RNA ligase step. The phosphatase' should then 
5 be inactivated and the RNA treated with tobacco acid pyrophosphatase in order to 
remove the cap structure present at the 5' ends of messenger RNAs. This reaction 
leaves a 5' phosphate group at the 5' end of the cap cleaved RNA, which can then be 
ligated to an RNA oligonucleotide using T4 RNA iigase. 

This modified RNA preparation is used as a template for first strand cDNA 
10 synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PCR amplification of the desired 5' end using a primer specific to 
the ligated RNA oligonucleotide and a primer specific to the known sequence of the 
. gene of interest. The resultant product is then sequenced and analyzed to confirm that " 
'the 5' end sequence belongs to the desired gene. 
15 ' 

Example 2: Isolation of Genomic Clones Corresponding to a 
Polynucleotide 

A human genomic PI library 1 (Genomic Systems, Inc.) is screened by PCR 
using primers selected for the cDNA sequence corresponding to SEQ ID NO:X., 
20 . according to the method described in Example 1 . (See also, Samhrook.) 

Example 3: Tissue Distribution of Polypeptide 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis; described by, 
25 among others, Sambrook et al: For example, a cDN A probe produced by the method * 
described in Example I is labeled with P n using the rediprime™ DNA labeling system 
(Amersham Life Science), according to manufacturer's instructions. After labeling, the 
probe is purified using CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), 
according to manufacturer's protocol number PT 1200-1. The purified labeled probe is ■ 
30 then used to examine various human tissues for mRNA expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) or 
human immune system tissues (IM) (Clontech) are examined with the labeled probe 
. using ExpressHyb™ hybridization solution (Clontech) according to manufacturer's 
protocol number PTl 190-1. Following hybridization and washing, the blots are 
35 mounted and exposed to film at -70°C overnight, and the films developed according to 
standard procedures. 
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Example 4: Chromosomal Mapping of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5' 
end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
5 primer set is then used in a polymerase chain reaction under the following set of 

conditions : 30 seconds, 95°C; 1 minute, 56°C; 1 minute, 70°C. This cycle is repeated 
32 times followed by one 5 minute cycle at 70°C. Human, mouse, and hamster Dlsf A 
is used as template in addicion to a somatic cell hybrid panel containing individual 
chromosomes or .chromosome fragments (Bios, Inc). The reactions is analyzed on 
10 either 8% polyacrylamide gels or 3.5 % agarose gels. Chromosome mapping is 

determined by the presence of an approximately 100 bp PCR fragment in the particular 
somatic cell hybrid. 

Example 5: Bacterial Expression of a Polypeptide 

15 A polynucleotide encoding a polypeptide of the present invention is amplified 

using PCR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA 
sequence, as oudined in Example 1, to synthesize insertion fragments. The primers 
- used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5' end of the primers in order to clone the amplified product 

20 into the expression vector. For example, BamHI and Xbal correspond to the restriction 
enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., Chatsworth, , 
CA). This piasmid vector encodes antibiotic resistance (Amp r ), a bacterial origin of 
replication .(on), an. IPTG-regulatable promoter/operator (P/O), a ribosome binding site 
(RBS), a 6-histidine tag (6-His), and restriction enzyme cloning sites. 

25 The pQE-9 vector is digested with BamHI and Xbal and the amplified fragment, 

is ligafed into the pQE-9 vector maintaining the reading frame initiated at the bacterial 
RBS. The ligation mixture is then used to transform the E. coli strain M15/rep4 
(Qiagen, Inc.) which contains multiple copies of the piasmid pREP4, which expresses 
the lad repressor and also confers kanamycin resistance (Kan 1 *)- Transformants are 

30 , identified by their ability to grow on LB plates and ampicillin/kanamycin resistant 

colonies are selected. Piasmid DNA is isolated and confirmed by restriction analysis. 

Clones containing the-desired constructs are grown' overnight (O/N) in liquid 

culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1:100 to 1:250. The 

35 ceils are grown to an optical density 600 (O.D. 600 ) of between 0.4 and 0.6. IPTG 



WO 9S/54963 



PCT/US98/1I422 



(Isopropyl-B-D-thiogaiacto pyranoside) is then added to a final concentration of i raM. 
IPTG induces by inactivating the lad repressor, clearing the P/O leading to increased 
gene expression. 

• Cells are grown for an extra 3 to 4 hours. Cells are then, harvested by 
5 centrifugation (20 mins at 6000Xg). The cell pellet is solubiiized in the chaotropic 

agent 6 Molar Guanidine HCl by stirring for 3-4 hours at 4°C The cell debris is 

removed by centrifugation, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available from 
QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high 

10 affinity and can be purified in a simple one -step procedure (for details see: The 
QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the. column in 6 M guanidine-HCl, pH 8, 
the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is e luted with 

15 6 M guanidine-HCl, pH 5. 

The purified protein is. then renatured by dialyzing it against phosphate-buffered 
saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the 
protein can be successfully refolded while immobilized on the Ni-NTA column. The 
recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 

20 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, containing protease inhibitors. 
The renaturation should be performed over a period of 1:5 hours or more. After 
renaturation the proteins are eluted by the addition of 250 mM immidazole. Immidazole 
is removed by a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer 
plus 200 mM NaCl. The purified protein is stored at 4° C or frozen at -80° C. 

25 In addition to the above expression vector, the present invention further includes 

an expression vector comprising phage, operator and promoter elements operatively 
linked to a polynucleotide of the present invention, called pHE4a. ( ATCC Accession 
Number 209645, deposited on February 25, 1998.) This vector contains: 1) a 
. neomycinphosphotransferase gene as a selection marker, 2) an E. coli origin of 

30 replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a 

Shine-Delgarno sequence, and 6) the lactose operon repressor gene (laclq). The origin 
of replication (oriC) is derived from pUC19 (LTI, Gaithersburg, MD). The promoter 
sequence and operator sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 

35 Xbal, BamHL Xhol, or Asp718, running the restricted product on a gel, and isolating 
. the larger fragment (the stuffer fragment should be about 3 10 base pairs). The.DNA 
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insert is generated according to the PCR protocol described in Example I /using PGR 
primers having restriction sites for Ndel (5 7 primer) and Xbal, BamHI, Xhol, or 
Asp718 (3' primer). The PCR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 
5 The engineered vector could easily be substituted in the above protocol to 

. express protein in a bacterial system. - 

Example 6: Purification of a Polypeptide from an Inclusion Bbdv 

The following alternative method can be used to purify a polypeptide expressed 
10 in E coli when it is present in the form of inclusion bodies. Unless otherwise specified, 

all of the following steps are conducted at 4-1 0°C 

Upon completion of the production phase of the £. coli fermentation, the cell 

culture is cooled to 4-10°C and the cells harvested by continuous centfifugation at 

15,000, rpm.(Heraeus Sepatech). On the basis of the expected yield of protein per unit 
15 weight of cell paste and the amount of purified protein required, an appropriate amount 
of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 
mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a 
. high shear mixer. 

- The cells are then lysed by passing the solution through a microfiuidizer 
20 (Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCI, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 
25 . hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the 

pellet is discarded and the polypeptide containing supernatant is incubated at 4°C 

overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble panicles, 
the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 
30 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 

vigorous stirring. The refolded diluted protein solution is kept at 4°C without mixing 

for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 

filtration unit equipped with 0.16 jim membrane filter with appropriate surface area 
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(e.g., Filtron), equilibrated with 40 mM sodium acetace, pH 6.0 is employed. The 
filtered sample, is loaded onto- a. cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column- is washed with 40 mM sodium acetate, pH 6.0 and eluted 
with 250 mM, 500 mM, 1000 mM; and 1500 mM NaCl in the same buffer, in a 
5 stepwise manner. The absorbance at 280 nm of the effluent is condnuously monitored. 
Fractions are collected and further analyzed by SDS-PAGE. 

. Fractions containing the polypeptide are then pooled and mixed with 4 volumes 
of water. The diluted sample is- then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion 

10 (Poros CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 
acetate, pFf 6.0, 200 -mM NaCL The CM-20 column is then eluted using a 10 column 
volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1 .0 
M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected under constant A, so - 

15 monitoring of the effluent. Fractions containing the polypeptide (determined, for 
instance, by 16% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands should be observed from 

Commassie blue stained 16% SDS-PAGE gel when 5 jig of purified protein is loaded. 

20 The purified protein can also be tested for endotoxin/LPS contamination, and typically 
. the LPS content is less than 0. 1 ng/ml according to LAL assays. 

Example 7: Cloning and Expression of a Polypeptide in a Baculovirus 
Expression System 

25 In this example, the plasmid shuttle vector pA2 is used to insert a polynucleotide 

into a baculovirus to express a polypeptide. This expression vector contains the strong 
polyhedrin promoter of the Autographa californica nuclear polyhidrosis virus 
(AcMNPV) followed by convenient restriction sites such as BamHI, Xba I and 
Asp718. The polyadenylation site of the simian virus 40 ("SV4CT) is used for efficient 

30 polyadenylation. For easy selection of recombinanc virus, the plasmid contains the 

beta-galactosidase gene from E. coli under control of a weak Drosophila promoter- in the 
same orientation, followed by the polyadenylation signal of the polyhedrin gene. The 
inserted genes are flanked on both sides by viral sequences for cell-mediated 
homologous recombination with wild-type viral DNA to generate a viable virus that 

35 express the cloned polynucleotide. 
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Many other baculovirus vectors can be used in place of the vector above, such 
as pAc373, pVL94I, and pAcIMl, as one skilled in the an would readily appreciate, as 
long as the construct provides appropriately located signals for transcription, 
translation, secretion and the like, including a signal peptide and an in-frame AUG as 
5 required. Such vectors are described, for instance, in Luckow et aL, Virology 170:31- 
39 (1989). 

Specifically, the cDNA sequence contained in the deposited clone, including the 
AUG initiation codon and the naturally associated leader sequence identified in Table 1, 
is amplified using the PCR protocol described in Example 1. If the naturally occurring 

10 signal sequence is used to produce the secreted protein, the pA2 vector does not need a 
second signal peptide. Alternatively-, the vector can be modified (pA2 GP).to include a 
baculovirus leader sequence, using the standard methods described in Summers et aL, 
. "A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures," 
Texas Agricultural Experimental Station Bulletin No. 1555 (1987). 

15 The amplified fragment is isolated from a 1% agarose gel using a commercially 

available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.)- The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

- The plasmid is digested with the corresponding restriction- enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 

20 procedures known in the art. The DNA is then isolated from a 1 % agarose gel using a 
commercially available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with T4 
DNA ligase. E. coli HB10I or other suitable E. coli hosts such as XL-1 Blue 
(Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 

25 mixture and spread on culture plates. Bacteria containing the plasmid are identified by 
digesting DNA from individual colonies and analyzing the digestion product by gel 
electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. ' 

Five jig of a plasmid containing the polynucleotide is co-transfected with 1.0 jig" 
30 of a commercially available linearized baculovirus DNA ("BaculoGold™ baculovirus 
DNA", Phanningen, San Diego, CA), using the lipofeccion method. described by 
Feigner et aL, Proc. Natl. Acad. ScL USA 84:7413-7417 (1987). One ug of 
BacuioGoId™ virus DNA and 5 jig of the plasmid are. mixed in a sterile well of a 
microtiter plate containing 50 ill of serum-free Grace's medium (Life Technologies 
35 Inc., Gaith'ersburg, MP). Afterwards, 10 jil Lipofectm plus 90 ill Grace's medium are 
added, mixed and incubated for 15 minutes at room temperature. Then the transfection 
mixture is added drop-wise to Sf9 insect cells ( ATCC CRL 1711) seeded in a 35 mm 
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tissue culture plate with 1 ml Grace's medium without serum. The plate is then 
incubated for 5 hours at 27° C The transfection solution is then removed from the plate 
and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum is added. 
Cultivation is then continued at 27° C for four days. 
5 After four days the supernatant is collected and a plaque assay is performed, as 

described by Summers and Smith, supra. Ah agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) isjused to allow easy identification and isolation of 
gal-expressing clones, which produce blue-stained plaques. (A detailed description of a 
"plaque assay" of this type can also be found in the user's guide for insect cell culture 

10 and baculovirology distributed by Life Technologies Inc., Gaithersburg, page 9-10.) 
After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifuge tube containing 200jli1 of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded in 

15 35 mm dishes. Four days later the supernatants of these culture dishes are harvested 
and then they are stored at 4° C. 

To verify the expression of the polypeptide,. Sf9 cells are grown in Grace's 

medium" supplemented with 10% heat-inactivated FBS. The cells are infected with the 

recombinant baculovirus containing the polynucleotide at a multiplicity of infection 

20 ("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later the medium is 
removed and is replaced with SF900 II medium minus methionine and cysteine 
(available from Life Technologies Inc., Rockville, MD). After 42 hours, 5 (iCi of 35 S- 
methionine and 5 (iCi j:> S-cysteine (available from Amersham) are added. The cells are 
further incubated for 16 hours and then are harvested by centrifugation. The proteins 

25 in the supernatant as well as the intracellular proteins are analyzed by SDS-PAGE 
followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of purified 
protein may be used to determine the amino terminal sequence of the produced 
protein. 



30 



ExampIeS: Expression of a Polypeptide in Mammalian Cells 

The polypeptide of the present invention can be expressed in a mammalian cell. 
A typical mammalian expression vector contains a promoter element, which mediates 
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the initiation of transcription of mRNA, a protein coding sequence, and signals required 
for the termination of transcripdon and polyadenylation of the transcript. Additional 
elements include enhancers, Kozak sequences and intervening sequences flanked by 
donor and acceptor sites for RNA splicing. Highly efficient transcription is achieved 
' 5 with the early and late promoters from SV40, the long terminal repeacs (LTRs) from 

Retroviruses, e.g., RSV\ HTLVI, HIVI and the early promoter of the cytomegalovirus 
(CMV). However, cellular elements can also be used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention include, 
for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), 
. 10 pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), 
pCMV Sport 2.0, and pCMVSport 3.Q-. Mammalian host cells that could be used 
include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 and CI 27 cells, Cos L 
Cos 7 and CV1,. quail QC1-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
cells. 

15 Alternatively, the polypeptide can be expressed in stable cell lines containing the 

polynucleotide integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr, gpt, neomycin, hygromycin allows the identification and isolation " 
of the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 

20 encoded protein. The DHFR (dihydrofolate reductase) marker is useful in developing 
cell lines that carry several hundred or even several thousand copies of the gene of 
interest. (See, e.g., Alt, F. W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, 
J. L. and Ma, C, Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 
Sydenham, M. A., Biotechnology 9:64-68 (1991).) Another useful selection marker is 
i 25 the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 227:277-279 (1991); 
Bebbington et al, Bio/Technology 10:169-175 (1992). Using these markers, the 
mammalian cells are grown in selective medium and the cells with the highest resistance 
are selected. These cell lines contain the amplified gene(s) integrated into a 
chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 

30 production of proteins. , 
Derivatives of the plasmid pSV'2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4-(ATCC Accession No. 209646) and pC6 (ATCC Accession, 
No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et 
al., Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the 

35 CMV-enhancer (Boshart et al, Cell 41:521-530(1985).) Multiple cloning sites, e.g., 
with the restriction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate the 
cloning of the gene of interest. The vectors also contain the 3* intron, the. 



WO 98/54963 



PCT/US98/11422 



polyadenylation and termination signal of the rat praproinsulin gene, and the mouse 
DHFR gene under control of the. S V40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf intestinal phosphates by 
5 procedures known in the an. The vector is then isolated from a 1% agarose gel. 

A polynucleotide of the present invention is amplified according to the protocol 
outlined in Example I. If the naturally occurring signal sequence is used to produce the 
secreted protein, the vector does not need a second signal peptide. Alternatively, if the 
naturally occurring signal sequence is not used, the .vector can be modified. to include a 

10 heterologous signal sequence. (See, e.g., WO 96/34891.) 

The amplified fragment is isolated from a 1 % agarose gel using a commercially 
available kit ("Geneciean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and 

15 purified on a 1% agarose gel. The isolated fragment and the dephosphorylated vector 
are then ligated with T4 DNA ligase. E. coli HB 101 or XL-1' Blue cells are then 
.transformed and bacteria are identified that contain the fragment inserted into plasmid 
pC6 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for 

20 transfection. Five p.g of the expression plasmid pC6 is cotransfected with 0.5 \xg of the 
plasmid pSVneo using lipofectin (Feigner et ah, supra). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme that 
confers resistance to a group of antibiotics including G418. The cells are seeded in 
alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 

25 trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 
MEM supplemented with 10, 2:5, or 50 ng/ml of metothrexate plus 1 mg/ml G418. 
After about 10-14 days single clones are trypsinized and then seeded in 6-weIi petri 
dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 100 nM, 
200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 

30 methotrexate are then transferred to new 6-well plates containing even higher 

concentrations of methotrexate (1 uM, 2 t uM, 5 jllM, 10 nuM, 20 mM). The same 
procedure is repeated until clones are obtained which grow at a concentration of 100 - 
200 JJ.M. Expression of the desired gene product is analyzed, for instance, by SDS- 
PAGE and Western blot or by reversed phase HPLC analysis. 



35 
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Example 9: Protein Fusions 

The polypeptides of the present invention are preferably fused to other proteins. 
These fusion proteins can be used for a variety of applications. For example, fusion of 
the present polypeptides to His-tag, HA-tag, protein A, IgG domains, and maltose 
5 binding protein facilitates purification. (See Example 5; see also EP A 394,827; 

Traunecker, et al„ Nature 331:84-86 (1988).) Similarly, fusion to IgG-!, IgG-3, and 
albumin increases the halflife time^in vivo. Nuclear localization signals fused to the 
polypeptides of the present invention can target the protein to a specific subcellular 
localization, while covalent heterodimer or homodimers can increase or decrease the 

10 activity of a fusion protein. Fusion proteins can also create chimeric molecules having 
more than one function. Finally, fusion proteins can increase solubility and/or stability 
of the fused protein compared to the non-fused protein. All of the types of fusion 
proteins described above can be made by modifying the following protocol, which 
outlines the fusion of a polypeptide to an IgG molecule, or the protocol described in 

15 Example 5. 

Briefly, the'human Fc portion of the IgG molecule can be PCR amplified, using 
primers that span the 5' and 3 ? ends of the sequence described below. These primers 
also should have convenient restriction enzyme sites that will facilitate cloning into an 
expression vector, preferably a mammalian expression vector. 

20 ■' For example, if pC4 (Accession No. 209646) is used,, the human Fc portion can 

be ligated into the BarnHI cloning site. Note that the 3' BamHI site should be 
destroyed. Next, the vector containing the human Fc portion is re-restricted with 
BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PCR protocol described in Example 1, is ligated into, this BamHI site. Note that 

25 the polynucleotide is cloned without a stop codon, otherwise a fusion protein will not 
be produced. 

If the naturally occurring signal sequence is used to produce the secreted 
- protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
30 heterologous signal sequence. (See, e.g., WO 96/34891.) 

Human IgG R: region: 

GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACCGTGCC 
CAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCCCCCCAAAACC 
35 CAAGGACACCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGTGGT 
GGACGTAAGCCACGAAGACCCTGAQGTCAAGTTCAACTGGTACGTGGACG 
GCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAAC 
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AGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTG 
AATGGC^GGAGTAC^GTGC^GGTCTCCA.^CA^GCCCTCCC^CCCCC 
ATCGAGA^A\CCATCTCC.AAAGCC^AGGGCAGCCCCGAGAACCACAGGT 
GTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCT 
5 GACCTGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGTGGAGTGGGA 
GAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGG 
ACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCA 
GGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGC 
ACAACCACTACACGCAG.\AGAGCCTCTCCCTGTCTCCGGGTA;A\TGAGTGC 
10 G ACGGCCGCG ACTCTAG AGG AT (SEQ ID NO: I ) 

Example 10: Production of an Antibody from a Polypeptide 

The antibodies of the present invention can be prepared by a variety of methods. 
(See, Current Protocols, Chapter 2.) For example, cells expressing a polypeptide of 

15 the present invention is administered to an animal to induce the production of sera 

containing polyclonal antibodies. In a preferred method, a preparation of the secreted 
protein.is prepared and purified to render it substantially free of natural contaminants. 
Such a preparation is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity. 

20 In the most preferred method, the antibodies of the present invention are 

monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al., Nature 
256:495 (1975); Kohler et al., Eur. J. Immunol. 6:51 1 (1976); Kohler et al.,.Eur. J. 
Immunol 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies and T-Cell 

25 Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such procedures 
involve immunizing an animal (preferably a mouse) with polypeptide or, more 
preferably, with a secreted polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is preferable to culture cells in Earle's 
modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at 

30 about 56°C), and supplemented with about 10 g/1 of nonessential amino acids, about 

1 ,000 U/ml of penicillin, and about 100 |ig/m] of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention;. however, it is preferable to employ the parent myeloma cell line 
35 (SP20), available from the ATCC After fusion, the resulting hybridoma cells are " ' 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
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described by Wands et al. (Gastroenterology 80:225-232 (198 1).) The hybridoma cells 
obtained through such a selection are then assayed to identify clones which secrete 
antibodies capable of binding the polypeptide. ' 

Alternatively, additional antibodies capable of binding to the polypeptide can be 
5 produced in a two-step procedure using- anti-idioty pic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 
mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
10 and the hybridoma cells are screened to identify clones which produce an antibody 

whose ability co bind to the protein-specific antibody can be blocked by the polypeptide. 
Such antibodies comprise anti-idiotypic antibodies' co the protein-specific antibody and 
can be used to immunize an animal to induce formation of further protein-specific 
antibodies. , 

15- It will be appreciated that Fab and F(ab')2 and other fragments of the antibodies 

of the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain 
(to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, 
secreted protein-binding fragments can be produced through the application of 

20 recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use - 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced using 
genetic constructs derived from hybridoma cells producing the monoclonal antibodies ■ 
described above. Methods for producing chimeric antibodies are known in the an. 

25 (See, for review, Morrison/Science 229: 1202 (1985): Oi et al., BioTechniques 4:214 
(1986); Cabilly et al., U.S. Patent No. 4,816,567; Taniguchi etaUEP 171496; 
Morrison et al., EP 173494; Neuberger et al., WO 8601533;. Robinson et al., WO 
.8702671; Boulianne et al., Nature 312:643 (1984); Neuberger et al., Nature 314:268 
. (1985).) 

30 

Example 11: Production Of Secreted Protein For High-Throughput 
Screening Assays ' 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in 
35 Examples 13-20: 

First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock 1 solution 
(img/ml in PBS) 1:20 in PBS (w/o calcium or magnesium 17-5 16F Biowhittak'er) for a 
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working solution of 50ug/mJ. Add 200 uJ of this solution to each well (24 well plates) 
and incubate at RT for 20 minutes. Be sure to distribute the solution over each well 
(note: a 12-channel pipetter may be used with tips on every other channel). Aspirate off 
the Poiy-D-Lysine solution and rinse with I mi PBS (Phosphate Buffered Saline). The 
5 PBS should remain in the well until just prior to plating the cells and plates may be 
poly-lysine coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10 5 cells/well in .5ml 
DMEM(Dulbecco's Modified Eagle Medium)(wich 4.5 G/L glucose and L-glutamine 
(I2-604F Biowhittaker))/10% heat inactivated FBS(.14~503F Biowhittaker)/lx 

10 Penstrep(17-602E Bio whit taker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(13324-012 Gtbco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-weIl plate. 
With a small volume multi-channel pipetter, aliquot approximately 2ug of an.expression 
vector containing a polynucleotide insert, produced by the methods described in 

15 Examples 8 or 9, into an appropriately labeled 96-weli round bottom plate. With a 

multi-channel pipetter, add 50ul of the Lipofectamine/Optimem I mixture to each well. 
Pipette up and down gently to mix. --Incubate at RT 15-45 minutes. After about 20 
minutes, use a multi-channel pipetter to add 150ul Optimem I to each well. As a 
control, one plate of vector DNA lacking an insert should be transfected with each set of 

20 transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too 
much time on PBS. First, person A aspirates off the media from four 24-well plates of 
ceils, and then person B rinses each well with .5- lml PBS. Person A then aspirates off 

25 PBS rinse, and person-B, using al2-channel pipetter with tips on every other channel, 
adds the 200ul of DNA/Lipofectamine/Optimem I complex to the odd wells first, then to 

the even wells, to each row on the 24-well plates. Incubate at 37°C for 6 hours. 

While cells are incubating, prepare appropriate media, either 1%BSA in DMEM 
with lx penstrep, or CHO-5 media ( 1 16.6 mg/L of CaC12 (anhyd); 0.00130 mg/L 

30 CuS0 4 -5H 2 0; 0.050 mg/L of Fe(NO r) ) :r 9H 2 0; 0.417 mg/L of FeS.0 4 -7H 2 0; 3 1 1.80 
mg/L of Kcl; 28.64 mg/L of MgCU; 48.84 mg/L of MgSO,; 6995.50 mg/L of NaCl; 
2400.0 mg/L of NaHC0 3 ; 62.50 mg/L of NaH.PO 4 -H 2 0; 71.02 mg/L of NaJCP04; 
.4320 mg/L of ZnS0 4 -7H 2 0; .002 mg/L of Arachidonic Acid ; 1.022 mg/L of 
Cholesterol; .070 mg/L of DL-alpha-Tocopherol-Acetate; 0.0520 mg/L of Linoleic 

35 Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of Myristic Acid; 0.010 mg/L of Oleic 
Acid; 0.010 mg/L of Palmitric Acid; 0.010 mg/L of Palmitic Acid; 100 mg/L of 
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Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 4551 mg/L of D- 
Glucose; 130.85 mg/ml of L- Alanine; 147.50 mg/mj of L-Arginine-HCL; 7.50 mg/ml 
of L-Asparagine-H 2 0; 6.65 mg/ml of L-Aspartic Acid; 29.56 mg/ml of L-Cystine- 
2HCL-H 2 0; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml of L-Glutamic Acid; 365.0 
5 mg/ml of L-Glutamine; 18.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine'-HCL- 
H 2 0; 106.97 mg/mi of L-Isoleucine; 111.45 mg/ml of L-Leucine; 163.75 mg/ml of L- 
Lysine HCL; 32.34 mg/ml. of L-Methionine; 68.48 mg/ml of L-Phenylaiainine; 40.0 
mg/ml . of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L-Threonine; 19.22 
mg/ml of L-Tryptophan;.91.79 mg/ml of L-Tryrosine-2Na-2H 2 0; 99.65 mg/ml' of L- 

10 Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1.78 mg/L of 
Choline Chloride; 4.65 mg/L. of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of 
Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.03 1 mg/L of Pyridoxme HCL: 0.3 19 
mg/L of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; and 
0.680 mg/L of Vitamin B I2 ; 25 mM of HEPES Buffer; 2.39 mg/L of Na Hypoxanthine; 

15 0.105 mg/L of Lipoic Acid; 0.081 mg/L of Sodium Putrescine-2HCL; 55.0 mg/L of 
Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM of Ethanolamine; 0.122 
mg/L of Ferric Citrate; 41.70 mg/L of Methyl-B-Cyclodextrin complexed with Linoleic 
Acid; 33.33 mg/L of Methyl-B-Cyclodextrin complexed with Oleic Acid; and 10 mg/L 
of Methyl-B-Cyclodextrin complexed with Retinal) with 2mm glutamine and Ix 

20 penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in 1L DMEM for a 10% BSA stock 
solution). Filter the media and collect 50 ul for endotoxin assay in 15ml polystyrene 
conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end of . 
the incubation period. Person A aspirates off the transfection media, while person B 

25 adds 1.5ml appropriate, media to each well. Incubate at 37°C for 45 or 72 hours 
depending on the media used: 1%BSA for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300ul multichannel pipetter, aliquot 60QuI in one 1ml deep 
well plate and the remaining supernatant into a 2ml deep well. The supematants from 
each well can then be used in the assays described in Examples 13-20. 

30 It is specifically understood that when activity is obtained in any of the assays 

described below using a supernatant, the activity originates from either the polypeptide 
directly (e.g., as a secreted protein) or by the polypeptide inducing expression of other 
proteins, which are then. secreted into the supernatant. Thus, the invention further s 
- provides a method of identifying the protein in the supernatant characterized by an 

35 activity in a particular assay. 
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Example 12: Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and proliferation 
of cells is called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs 
pathway bind to gamma activation site "GAS" elements or interferon-sensitive 
5 responsive element ("ISRE"), located in the promoter of many genes. The binding of a 
protein to these elements alter theexpression of the associated gene. 

GAS and ISRE elements are recognized by a class- of transcription factors called 
Signal Transducers and Activators of Transcription, or "STATs." There are six 
members of the STATs family. Statl and Stat3 are present in many cell types, as is 

10 ' Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many cell types though it has been found in T helper class I, cells after treatment with 
IL-12. Stat5 was. originally called mammary growth factor, but has been found at 
higher concentrations in other cells including myeloid cells. It can be activated in tissue 
culture cells by many cytokines. 

15 The STATs are activated to translocate from the cytoplasm to the nucleus upon 

tyrosine phosphorylation by a set of kinases known as. the Janus Kinase ("Jaks") 
family.- Jaks represent a distinct farjiily of soluble tyrosine kinases and include Tyk2, 
Jakl , Jak2, and Jak3. These kinases display significant sequence similarity and are 
generally catalydcally inactive in resting cells. 

20 The Jaks are activated by a wide range of receptors summarized in the Table 

below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621-51 
(1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, IL-9, IL-1 1, IL- 
12, IL-15, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoieun; and 

25 (b) Class 2 includes IFN-a, IFN-g, and IL-10. The Class 1 receptors share a 

conserved cysteine motif (a set of four conserved cysteines and one tryptophan) and a 
WSXWS motif (a membrane proxial region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID 
. NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
30 activate STATs, which then translocate and bind to GAS elements. This entire process 
is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and cytokines are 
35 known to activate the Jaks-STATs pathway. (See Table below.) Thus, by using- GAS 
elements linked to reporter molecules, activators of the Jaks-STATs pathway can be 
identified. 
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To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 13-14, a PCR based strategy is employed to 
generate a GAS-SV40 promoter sequence. The 5' primer contains four tandem copies 
of the GAS binding site found in the IRFl promoter and previously demonstrated to 
5 bind STATs upon induction with a range of cytokines (Rothman et aL Immunity 

1 :457-468 (1994).), although other GAS or IS RE elements can be used instead. The 5 7 
primer also contains lSbp of sequence complementary to the SV40 early- promoter ' 
sequence and is flanked with an Xhol site. . The sequence of the 5 ? primer is: 

S'iGCGCCTCGAGATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCG 
10 ' AAATGATTTCCCCGAA,\TATCTGCCATCTCAATTAG:3 ? (SEQ ID NO:3) 

The downstream primer is complementary to the SV40 promoter and is flanked 
with a Hand III site: 5\GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' (SEQ ID 
NO:4) 

PCR amplification is performed using the SV40 promoter template present in 
.15 the B-gal:promoter plasmid obtained from Clontech. The. resulting PCR fragment is 
digested with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing 
with forward and reverse primers. confirms that the insert contains the following 
sequence: 

5': CT^GAS ATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCGA A ATG 
20 ATTTCCCCGA.A^TATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCC 
CTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGC 
CCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAG 

CTCrGAGCrATTCCAGA^GTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTT 
TGC AAA AAGCTT : 3 ? (SEQ ID NO:5) 

25 With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 

reporter construct is' next engineered. Here, the reporter molecule is a secreted alkaline 
phosphatase, or "SEAP Clearly, however, any reporter molecule can be instead of 
. SEAP, in this or in any of the other Examples. Well known reporter molecules that can 
be used instead of SEAP include chloramphenicol acetyltransferase (CAT), luciferase, 

30 alkaline phosphatase, B-galactosidase, green fluorescent protein (GFP), or any protein 
detectable by an antibody. 

The above sequence confirmed synthetic G AS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindlll and 
Xhol, effectively replacing the SV40 promoter with the amplified GAS: SV40 promoter 

35 element, to create the GAS-SEAP vector. However, this vector does not contain a 
neomycin resistance gene, and therefore, is not preferred for mammalian expression 
systenhs. 
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Thus, in order to generate mammalian stable, cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
Sail and NotI, and inserted into a backbone vector containing the neomycin resistance 
gene, such as-pGFP-1 (Clontech), using these restriction sites in the multiple cloning 
5 site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 

mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
as. described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS 
with a different promoter sequence.' For example, construction of reporter molecules 

10 containing NFK-B and EGR promoter sequences are described in Examples 15 and 16. 
However, many other promoters can be substituted using the protocols described in 
these Examples. For instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be 
substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, II- 
2/NFAT, or NF-KB/GAS). Similarly , other cell lines can be used to test reporter 

15 construct activity, such as HELA (epithelial), HUVEC (endothelial), Reh (B-cell), 
Saos-2 (osteoblast), HUVAC (aortic), or Cardio myocyte. 

Example 13: High-Throughput Screening Assav for T-cell Activity. 

The following protocol is used to assess T-cell activity by identifying factors, 
20 such as growth factors and cytokines, that may proliferate or differentiate T-cells. T- 
cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks-STATS 
signal transduction pathway. The T-cell used in this assay is Jurkat T-cells (ATCC 
Accession No. TEB- 152); although Molt-3 cells (ATCC Accession No. CRL-1552) and 
25 Molt-4. cells (ATCC Accession No. CRJL-1582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are transfected with the GAS- 
SEAP/neo vector using DMRIE-C (Life TechnoIogies)(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 
30 20,000 cells per well and transfectants resistant to 1 mg/ml genticin selected. Resistant 
colonies are expanded and then tested for their response to increasing concentrations of 
interferon gamma. The dose response of a selected clone is demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
, containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 
35 generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in RPMI 
+ 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life Technologies) 
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with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM containing 50 ul 
of DMREE-C and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of ceils (10 7 per transfection), and resuspend in OPTI-MEM to a final 
5 concentration of 10 7 cells/ml. Then add 1ml of 1 x 10 7 cells in OPTI-MEM to T25 flask 
and incubate at 37°C for 6 hrs. After the incubation, add 10 ml of RPMI + 15% serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 
serum, 1 rag/ml Genticin, and 1% Pen-Screp. These cells are treated with supernatants 
containing a polypeptide as- produced by the protocol described in Example 1.1. 
10 On the day of treatment with the supernatant, the cells should be washed and 

resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The 
exact number of cells required will depend on the number of supematants being 
screened. For one 96 well plate, approximately 10 million cells^for 10 plates, 100 
million cells) are required. 
15 Transfer the cells to a triangular reservoir boat, in order to'dispense the cells into 

a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 200 ul 
of cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supernatants are transferred 
directly from the 96 well plate containing the supernatants into each. well using a 12 ' 
20 channel pipette.. In addition, a dose of exogenous interferon gamma (0.1, 1.0, 10 ng) 
, is added to wells H9, H10, and HI 1 to serve as additional positive controls for the 
assay. 

The 96 well dishes containing Jurkat cells treated with supernatants are placed in 
an incubator for 48 hrs (note: this time is variable between 4S-72 hrs). 35 ul samples 
• 25 from each well are then transferred to an opaque 96 well plate using a 12 channel 

pipette. The opaque plates should be covered (using sellophene covers) and stored at - 
20°C until SEAP assays are performed according to Example 17. The plates 
containing the remaining treated cells are placed at 4°C and serve as a source of material 
for repeating the assay on a specific well if desired. 
30 As a positive control, 100 Unit/ml interferon gamma can be used which is 

known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 
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Example 14: High-Throughput Screening Assay Identifying Myeloid 
Activity 

The following protocol is .used to assess myeloid activity -by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate myeloid cells. 
5 Myeloid cell activity is assessed using the GAS/SEAP/Neo construct produced in 

Example 12. Thus, factors that increase SEAP activity indicate the ability to activate the 
Jaks-STATS signal transduction pathway ; The myeloid cell used in this assay is U937, 
a pre-monocyte cell line; although TF-i, HL60, or KG i can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct produced 
10 in Example 12, a DEAE-Dextran method (Kharbanda et. al., 1994,. Cell Growth & 

Differentiation, 5:259-265) is used. First, harvest 2xl0e 7 U937 cells and wash with 
PBS. The U937 cells are usually grown in RPMI 1640 medium containing 10% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 
mg/ml streptomycin. 

15 Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 

0.5 mg/ml DEAE-Dextran, 8 ug GA3-SEAP2 plasmid DNA, 140 mM NaCL 5 mM 

KC1, 375 uM Na 2 HP0 4 .7H20, 1 mM MgCb, and 675 uM CaCb. Incubate at 37°C 

for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
20 resuspend in 10 ml complete medium and incubate at 37°C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/ml G418. The G418-free medium is used for routine growth but every one to two 

months, the cells should be re -grown in 400 ug/ml G418 for couple of passages/ 

s ~ 
These cells are tested by harvesting 1x10 cells (this is enough for ten 96-well 

25 plates assay) and wash with PBS. Suspend the cells in 200 ml above described growth 

medium, with a final density of 5xl0 5 cells/ml. Plate 200 ul cells per well in the 96- . 

well plate (or 1x10 s cells/well). ; 

Add 50 ul of the supernatant prepared by the protocol described in Example 11. 

Incubate at 37°C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon gamma 
30 . can be used which is known to activate U937 cells. Over 30 fold induction is typically 

observed in the positive control wells. SEAP assay the supernatant according to the 

protocol described in Example 17. 
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Example 15: High-Throughput Screening Assay Identifying Neuronal 
Activity. 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these genes. 
5 EGR1 (early growth response gene 1) 7 is induced in various tissues and cell types upon 
activation. The promoter of EGR1 is responsible for such induction. Using the EGR1 
promoter linked to reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC 12 
ceil lines. PC 12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
10 differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl 

phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). The 
EGR1 gene expression is activated during this treatment. Thus, by stably transfecting 
PC 12 cells with a construct containing an EGR promoter linked to SEAP reporter, 
activation of PC 12 cells can be assessed. 
15 The EGR/SEAP reporter construct can be assembled by the following protocol. 

The EGR-1 promoter sequence (-633 to -fl)(Sakamoto K et aL, Oncogene 6:867-871 
(1991)) can- be PCR amplified from-human genomic DNA using the following primers: 

5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQIDNO:6) 

5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3' (SEQ ID NO:7) 
20 Using the GAS:SEAP/Neo vector produced in Example 12, EGR1 amplified 

product can then be inserted into this vector.^ Linearize the GAS:SEAP/Neo vector 
using restriction enzymes Xhol/Hindlll, removing the GAS/SV40 stuffer. Restrict the 
EGR I amplified product with these same enzymes. Ligate the vector and the EGR1 
promoter. 

25 To prepare 96 well-plates for cell culture, two mis of a coating solution ( 1 :30 

dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96-well plate, and 
allowed to air dry for 2 hr. 

PC12 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) 
30 containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 
ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is done 
every' three to four days. Cells are removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 
■ 35 Transfect the EGR/SEAP/Neo construct into PC 12 using the Lipofectamine 

protocol described in Example 11. EGR-SEAP/PC12 stable cells are obtained by 
growing the cells in 300 ug/ml G418. The G418-free .medium is used for routine 
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growth but every one to two months, the^cells should be re-grown in 300 ug/ml G418 
for couple of passages. 

To assay for neuronal activity, a 10 cm plate with ceLls around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
5 (Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
containing 1% horse serum and 0.5% FBS with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape 
off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 

the ceil number and add more low serum medium to reach final cell density as 5x10^ 
10 cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
lxlO 5 cells/well). Add 50 ul supernatant produced by Example 11, 37°C for 48 to 72 
hr. As a positive control, a growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold 
15 induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

Example 16: High-Throughput Screening Assav for T-cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide variety 

20 of agents including the inflammatory cytokines IL-1 and TNF, CD30 and CD40, 

lymphotoxin-alpha abd lymphotoxin-beta, by exposure to LPS or thrombin, and by 

expression of certain viral gene products. As a transcription factor, NF-kB regulates 
the expression of genes involved in immune ceil activation, control of apoptosis (NF- 
kB appears to shield cells from apoptosis), B and T-cell development,, anti-viral and • 
25 antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with 1-kB 

(Inhibitor. kB). However, upon stimulation, I- kB is phosphoryiated and degraded, 

causing NF- kB to shuttle to the nucleus, thereby activating transcription of target 

genes. Target 'genes activated by NF- kB include IL-2, IL-6, GM-CSF, ICAM-1 and 

30 class 1 MHC. 

Due to its central role and ability to respond to a range of stimuli, reporter 

constructs utilizing the NF-kB promoter element are used to screen the supernatants 

produced in Example i 1. Activators or inhibitors of NF-kJ8 would be useful in treating 
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diseases. For example, inhibitors of NF-kB could be used to treat those diseases 
related to the acute or chronic activation of NF-kB, such as rheumatoid arthritis. 

To construct- a vector containing the NF-kB promoter element, a PCR based 

strategy is employed. The upstream primer contains four tandem copies of the NF-kB 

5 binding site (GGGGACTTTCCC) (SEQ ID NO:S), 13 bp of sequence complementary 
to the 5 7 end of the SV40 early promoter sequence, and is flanked with an Xhol site: 
5':GCGGCCTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCGGGAC 
TTTCCATCCTGCCATCTCAATTAG:3 ' (SEQIDNO:9) 

The downstream primer is complementary to the 3 ? end of the SV40 promoter 
10 and is flanked with a Hind III site: 

5':GCGGCAAGC1T1TTGCAAAGCCTAGGC:3 ? (SEQ.IDNO:4) 

PCR amplification'is performed using the SV40 promoter template present-in 
the pB-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol and Hind III and subcloned into BLSK2-. (Stratagene) 
15 Sequencing with the T7 and T3 primers confirms the insert contains the following 
sequence: 

S^CTCGAGGGGACrTTCCCGGGGACTTTCCGGGGACTTTCCGGGACTTTCC 
ATCTGCCATCTCAATTAGTCAGCAACCATxAGTC 

20 TCCCGCCCCrAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACT 
AAI 1 1 1 1 1 11ATTTATGCAGAGGCCGAGG.CCGCCTCGGCCTCTGAGCTATTC 
CAGAAGTAGTGAGGAGGCT'l'l 1U1GGAGGCCTAGGCTTTTGCA.AAAAGCTT: 
3' (SEQ ID NO: 10) 

25 Next, replace the SV40 minimal promoter element present in the pSEAP2- 

promoter plasmid (Clontech) with this NF-KB/SV40 fragment using Xhol and Hindlll. 

However, this vector does not contain a neomycin resistance gene, and therefore, is not 
preferred for mammalian expression systems . 

In order to generate stable mammalian cell lines, the NF-kB/SV40/SEAP 

30 cassette is removed from the above NF-kB/SEAP vector using restriction enzymes Sail 
and Nod, and inserted into a vector containing neomycin resistance. Particularly, the 
NF-KB/SV40/SEAP cassette was inserted into pGFP-1 (Clontech), replacing the GFP 
gene, after restricting pGFP-1 with Sail and Not! 
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Once NF-KB/SV40/SEAP/Neo vector is created, stable Jurkat T-cells are 

created and maintained according to the protocol described in Example. 13. Similarly, 
the method for assaying supernatants with these stable Jurkat T-cells is also described 
in Example 13. As a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to 
5 wells H9, H10, and HI 1, with a 5-10 fold activation typically observed. 

Example 17: Assav for SEAP Activity 

As a reporter molecule for the assays described in Examples 13-16, SEAP 
activity is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the 
10 following general procedure. The, Tropix Phospho-light Kit supplies the Dilution, 
Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5 x Dilution Buffer and dispense 15 jxl of 2.5 x 
dilution buffer into Optiplates containing 35 jil of a supernatant. , Seal the plates with a 

plastic sealer and incubate at 65°C for 30 min. Separate the Optiplates to avoid uneven j 
15 heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 

prime with the Assay Buffer. Add 50 jxl Assay Buffer and incubate at room 

temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 

table below). Add 50 jjl! Reaction Buffer and incubate at room temperature for 20 , 

20 minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 

takes about 10 minutes to read 5 plates on luminometer, one should treat 5 plates at each 
time and start the second set 10 minutes later. 

Read the relative light unit in the luminometer. Set H12 as blank, and prim the 
results. An increase in chemiluminescence indicates reporter activity. 

25 



Reaction Buffer Formulation: 



# of plates 


Rxn buffer diluent (rnl) 


CSPD (mi) 




60 


3 


n 


65 


3.25 


12 


70 


3.5 


13 


75 ^ - ■ 


3.75 


14 


80 . , 


4 


15 . 


35 


4.25 


16 . 


90 


4.5 


17 


95 


4.7.5^ 


18 


' 100 


5 


19 


105 


5.25 


20 


no 


5.5 


21 


115 


5.75 


22 


120 
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23 


125 


6.25 


24 


130 


6.5 


25 


135 


6.75 


26 


140 
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27 


145 


7.25 


28 


150 


7.5 


29- 


155 


7.75 


30 


160 


8 


3 1 


165 


8.25 


32 


170 


8.5 


33 


175 


8.75 


34 ■ 


180 


9 


35 


185 


9.25 


36 


190 


9.5 


37 


195 


9.75 


38 


200 


10 


39 


205 


10.25 


40 




10.5 


4 1 


215 


10.75 


42 


6.£m\J 




43 




1 1 .25 


44 




I 1 s 


45 


235 


11.75 


46 


240 


12 


47 


245 


12.25 


48 ■ ' 


250 


12.5 


49 


255 


12.75 


50 


260 


13 



Example 18: High-Throughput Screening Assav Identifying Changes in 
Small Molecule Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
5 molecules, such as -calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supernatants which 
bind to receptors of a particular cell." Although the following protocol describes an 
assay for calcium, this protocol can easily be modified to detect changes in potassium; 
sodium, pH, membrane potential, or any other small molecule which is detectable by a 
10 fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-3, used here. 
15 For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star black 

- 96- well plate with clear bottom. The plate is incubated in a CO z incubator for 20 hours. 
The adherent cells are washed two times in Biotek washer with 200 ul of HBSS 
(Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. - 
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A stock solution of 1 mg/ml fluo-3 is made in 10% pluronic acid DMSO. To 
load the cells with fluo-3, 50 ul of 12 ug/ml fluo-3 is added to each well. The plate is 
incubated at 37°C in a C0 2 incubator for 60 min. The plate is washed four times in the 
■ Biotek washer with HBSS leaving 100 ul of buffer. 
5 For non-adherent cells, the cells are spun down from culture media: Cells are 

re-suspended to 2-5x10° cells/ml' with HBSS in a 50- ml conical tube. 4 ul of 1 mg/ml 
fluo-3 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 
The tube is then placed in a 37°C water bath for 30-60 min. The cells are washed twice 
with HBSS, resuspended to 1x10° cells/ml, and dispensed into a microplate, 100 
10 ul/well. The plate is centrifuged at 1000 rpm for 5 min. The plate is then washed once 
in Denley CellWash with 200 ul, followed by an aspiration step to 100 ul final volume. 

For a non-cell based assay /each well contains a fluorescent molecule, such as 
fluo-3. The supernatant is added to the well, and a change in fluorescence is detected. 
To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
15 following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 

second; (3) Camera F/stop is F/2; (4)vExcitation is 488 nm; (5) Emission is 530 nm; and 
- ' (6) Sample addition is 50 ul. Increased emission at 530-nm indicates an extracellular 

signaling event which has resulted in an increase in the intracellular Ca^ 
' concentration. 

20 

Example 19: High-Throughput Screening Assav Identifying Tyrosine 
Kinase Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase 

25 RPTK) group are receptors for a range of autogenic and metabolic growth factors 
including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. In 
addition there are a large family of RPTKs for which the corresponding ligand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

30 Activation of RPTK by ligands involves ligand-mediated receptor dimerization, 

resulting in transphosphorylation of the receptor subunits and activation of the _ 
cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include receptor 
associated tyrosine kinases of the src-family (e.g., src, yes, Ick, Iyn, fyn) and non- 
receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, members 

35 of which mediate signal transduction triggered by the cytokine superfamily of receptors 
. (e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 
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Because of the wide range of known factors capable of stimulating tyrosine 
kinase activity, the identification of novel human secreted proteins capable of activating 
tyrosine kinase signal transduction pathways are of interest. Therefore, the following 
protocol is designed to identify* those novel human secreted proteins capable of 
5 activating the tyrosine kinase signal transduction pathways. 

Seed target cells (e.g., .primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses with 
100% ethanoL rinsed with water and dried overnight. Some plates are coated for 2 hr 

10 with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or poly lysine 
(50 mg/ml), all of which can be purchased from Sigma Chemicals (St. Louis, MO) or 
10% Matrigel purchased from Becton Dickinson (Bedford,MA), or calf serum, rinsed 
with PBS and stored at 4°C Cell growth on these plates is assayed by seeding 5,000 
cells/well in growth medium and indirect quantitation of cell number through use of 

15 alamarBlue as described by the manufacturer Alamar Biosciences, Inc. (Sacramento, 
CA) after 48 hr. Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are 
used to cover the Loprodyne Silent Screen Plates. Falcon Microtest III cell culture 
plates can also be used in some proliferation experiments. 

■ To prepare extracts, A431 cells are seeded onto the nylon membranes of 

20 ' Loprodyne plates (20,000/200ml/well) and cultured overnight in complete medium. 
. Cells are quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 
minutes treatment with EGF (60ng/ml) or 50 ul of the supernatant produced in Example 
.11, the medium was removed and 100 ml of extraction buffer ((20 mM HEPES pH 
7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM Na4P207 

25 and a cocktail of protease inhibitors (# 1836170) obtained from Boeheringer Mannheim 
(Indianapolis, IN) is added to each well and the plate is shaken on a rotating shaker for 
5 minutes at 4°C. The plate is then placed in a vacuum transfer manifold and the extract 
filtered through the 0.45 mm membrane bottoms of each well using house vacuum. 
Extracts are collected in a 96-well catch/assay plate in the bottom of the vacuum 

30 manifold and immediately placed on ice. To obtain extracts clarified by centrifugation, 
the content of each well, after detergent solubilization for 5 minutes, is removed and 
centrifuged for 15 minutes at 4°C at 16,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 

35 Generally, the tyrosine kinase activity of a supernatant is evaluated by 

determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
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biotinylated peptide). Biotinylated peptides that ean be used for this purpose include 
PSK1 (corresponding to amino acids 6-20.of the cell division kinase cdc2-p34) and 
PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer.Mannheim. 
5 " The tyrosine kinase reaction, is set up by adding the following components in 

order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+(5mM 
ATP/50mM MgCb), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 
pH7.3, 40 miVl beta-glycerophosphate, ImM EGTA, lOOmM MgCb, 5 mM MnCl? 
0.5-mg/ml BSA). then 5ul of Sodium Vanadate(lmM), and then 5ui of water. Mix the 

10 components gently and preincubate the reaction mix at 30°C for 2 min. Initial the 
reaction by adding lOul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then tenninated by adding 10 ul of 120mm 
EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 

15 mixture to a microtiter plate (MTP) module and incubating at 37°C for 20 min. This 
allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
Wash the MTP module, with 300ul/well of PBS four times. Next add 75 ul of anti- 
phospotyrosine antibody conjugated to horse radish,peroxidase(anti-P-Tyr- 
POD(0.5u/mI)) to each well and incubate at 37°C for one hour. Wash the well as 

20 above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
. absorbance of the sample at 405 nm by using ELIS A reader. The level of bound 
peroxidase activity is quantitated using an ELISA reader and reflects the level of 
25 tyrosine kinase activity. 

Example 20: High-Throughput Screening Assav Identifying 
Phosphorylation Activity 

As a potential alternative and/or compliment to the assay of protein tyrosine 
30 . kinase activity described in Example 19, an assay which detects activation 

(phosphorylation) of major intracellular signal transduction intermediates can also be 
used. For example, as described below one particular assay can. detect tyrosine 
phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase, 
35 S re, -Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
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phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
substituting these molecules for Erk-1 or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELISA 
plate with 0. lml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then 
rinsed with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G plates 
are then treated with 2 commercial monoclonal antibodies (lOOng/well) against Erk-1 
and Erk-2 (1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a. monoclonal antibody detecting any of the 

above described molecules.) After 3-5 rinses with PBS, the plates are stored at 4°C 
until use. 

A43 1 cells are seeded at 20,000/well in a 96-well Loprodyne fiiterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and' then treated with EGF (6ng/well) or 50 ul of the supernatants 
obtained in Example 11 for 5-20 minutes. The cells are then solubilized and extracts 
filtered directly into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinase (lOng/well) is used in place 
of A43 1 extract. Plates are then treated with a commercial polyclonal (rabbit) antibody 
(lug/ml) which specifically recognizes the phosphorylated epitope of the Erk-1 and 
Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard procedures. The 
bound polyclonal antibody is then quantitated by successive incubations with 
Europium-streptavidin and Europium fluorescence enhancing reagent in the Wallac 
DELFIA instrument (time -resolved fluorescence). An increased fluorescent signal over 
background indicates a phosphorylation. 

Example 21: Method of Determining Alterations in a Gene 
Corresponding to a Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated., cDNA is then generated from 
these RNA samples using protocols known in the art. (See, Sambrook.) The cDNA is 
then used as a template for PCR, employing primers surrounding regions of interest in 

SEQ ID NO:X. Suggested PCR conditions consist of 35 cycles at 95°C for 30 

seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at70°C, using buffer 

solutions described in Sidransky, D., et al., Science 252:706 (1991). 

PCR products are then sequenced using primers labeled at their 5' end with T4 
polynucleotide kinase, employing SequiThenn Polymerase. (Epicentre Technologies)-. 
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The intron-exon borders of selected exons is also determined and genomic PCR 
products analyzed to confirm the results. PCR products harboring suspected mutations 
- is then cioned and sequenced to validate the results of the direct sequencing. 

PCR products is cloned into T-tailed vectors as described in. Holton, T.A. and 

5 " Graham, M.W., Nucleic Acids Research, 19:1 156 (1991) and sequenced with T7 
polymerase (United States Biochemical). Affected individuals are identified by 
mutations not present in unaffected individuals. 

Genomic rearrangements are also observed as a method of deteimining 
alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 

10 according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5- 

triphosphate (Boehringer Manheim), and FISH performed as described. in Johnson, 
. Cg. et al., Methods Cell Biol. 35:73-99 (1991). Hybridization with the labeled probe is 
carried out using a vast excess of human cot- 1 DNA for specific hybridization to the 
corresponding genomic locus. 

15 Chromosomes are counterstained with 4,6-diamino-2-phenylidole and 

propidium iodide, producing a combination of C- and R-bands. Aligned images for 
precise mapping are obtained using a triple-band filter set (Chroma Technology, 
Brattieboro, VT) in combination with a cooled charge-coupled device camera 
(Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson, Cv. 

20 et al., Genet. Anal. Tech. Appl., 8:75 (1991).) Image collection, analysis and 

chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of 
the genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 

25 disease. 

Example 22: Method of Detecting Abnormal Levels of a Polypeptide in a 
Biological Sample 

A polypeptide of the present invention can be detected in a biological sample, 
30 and if an increased or decreased level of the polypeptide is detected, this polypeptide is 
a marker for a particular phenotype. Methods of detection are numerous, and thus, it is 
understood that one skilled in the art can modify. the following assay to fit their 
particular needs. 

For example, antibody-sandwich ELISAs are used to detect polypeptides in a " 
35 sample, preferably a biological sample. Wells of a microtiter plate are coated with 

specific antibodies, at a. final concentration of 0.2 to 10 ug/ml. The antibodies- are either 
monoclonal or polyclonal and are produced by the method described in Example 10. 
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The wells are blocked so that non-specific binding of the polypeptide to the well is 
reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing the polypeptide. Preferably, serial dilutions of the sample should be used to 
validate results. The plates are then washed three times with deionized or distilled water 
to remove unbounded polypeptide. * 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 
concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 
phosphate (NPP)- substrate solution to each well and incubate f hour at room 
temperature. Measure the reaction by a microtiter plate reader. Prepare a standard 
curve, using serial dilutions of a control sample, and plot polypeptide concentration on 
the X-axis (log scale) and fluorescence or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard curve. 

Example 23: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed -in a fashion 
consistent with good medical practice, taking into account the clinical condition of the 
individual patient (especially the side effects of treatment with the secreted polypeptide 
alone), the site of delivery, the method of administration, the scheduling of 
administration, and other factors known to practitioners. The "effective amount" for 
purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutically effective amount of secreted 
polypeptide administered paremerally per dose will be in the range of about 1 |ig/kg/day 
to L0 mg/kg/day of patient body weight, although, as noted above, this will be subject 
to therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and 
most preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If 
given continuously, the secreted polypeptide is typically administered at a dose rate of 
about 1 p.g/kg/hour to about 50 jig/kg/hour, either by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An intravenous 
bag solution may also be employed. The length of treatment needed to observe changes 
and the interval following treatment for responses to occur appears to vary depending 
on the desired effect. 

Pharmaceutical compositions containing the secreted protein of the invention are 
administered orally, rectally. parenterally 7 intracistemally. intravaginally, 
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intraperitoneal^, topically (as by powders, ointments, gels, drops or transdermal 
patch); bucally, or as an oral or nasal spray. "Pharmaceucically acceptable carrier" refers 
to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or 
formulation auxiliary of any type. The term "parenteral" as used herein refers to modes 
5 of administration which include intravenous, intramuscular, intraperitoneal, intrasternal, 
subcutaneous and intraarticular injection and infusion. 

The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi-permeable 
polymer matrices in the form of shaped articles, e.g., films, or mirocapsules. 

10 Sustained-release matrices include polylactides (U.S. Pat. No. '3,773,919, EP 53,481), 
copolymers of L-glutamic acid and gamma-ethyl-L-giutamate (Sidman ; U. et al., 
Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl methacryiate) (R. Langer et 
al., J. Biomed. Mater. Res. 15:167-277 (1981), and R. Langer, Chem. Tech. 12:98- 
105 (1982)), ethylene vinyl acetate (R, Langer et al.) or poly-D- (-)-3-hydroxybutyric 

15, acid (EP 133,988). Sustained- release compositions also include liposomally entrapped 
polypeptides. Liposomes containing the secreted polypeptide are prepared by methods 
known per se: DE 3,218,121; Epstein et al., Proc. Natl. Acad. Sci. USA 82:368-8-3692 
(1985); Hwang et al.; Proc. Natl. Acad. Sci. USA 77:4030-4034 (1980); EP 52,322; 
EP 36,676; EP 88,046; £P 143,949; EP 142,641;. Japanese Pat. Appl. 83-1 18008; 

20 U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324. Ordinarily, the liposomes 
are of the small (about 200-800 Angstroms) unilamellar type in which the lipid content 
is greater than about 30 mol. percent cholesterol, the selected proportion being adjusted 
for the optimal secreted polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 

25 formulated generally by mixing it at the desired degree of purity, in a unit dosage 

injectable form (solution, suspension, or emulsion), with a pharmaceutical^ acceptable 
carrier, i.e., one that is non-toxic to recipients at the dosages and concentrations 
employed and is compatible with other ingredients of the formulation. For example, the 
formulation preferably does not include oxidizing agents and other compounds that are 

30 known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with, liquid carriers or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably the 
carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 

35 of the recipient. Examples of such carrier vehicles include water, saline, Ringer's ' 
solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful- herein, as well as liposomes. 
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The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, 
citrate, succinate, acetic acid, and other organic acids or their salts; antioxidants such as 
5 ascorbic acid; low molecular weight (less than about ten residues) polypeptides, e.g., 
polyarginme or tripeptides; proteins, such as serum albumin, gelatin, or • 
immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids, 
such as glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, 
disaccharides, and other carbohydrates including cellulose or its derivatives, glucose, 
10 manose, or dextrins; chelating .agents such as EDTA; sugar alcohols such as mannitol or 
sorbitol; counterions such as sodium; and/or nonionic surfactants such as polysorbates, 
poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a 
concentration of about 0. 1 mg/mi to 100 mg/mi, preferably 1- 10 mg/ml, at a pH of 
15 about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes (e.g., 
.0.2 micron membranes). Therapeutic polypeptide compositions generally are placed 
20 into a container having a sterile access port, for example, an intravenous solution bag or 
vial having a stopper pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 
'formulation for reconstitution. As an example of a lyophilized formulation, 10- ml vials . 
25 are filled with 5 ml of sterile-filtered 1% (w/v) aqueous polypeptide solution, and the 
resulting mixture is lyophilized. The infusion solution is prepared by reconstituting the 
lyophilized polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
30 compositions of the invention. Associated with such container(s) can be a notice in the 
form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by the agency of 
4 manufacture, use or sale for human administration. In addition, the polypeptides of the 
present invendon may be employed in conjunction with other therapeutic compounds. 

35 
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Example 24: Method of Treating Decreased Levels of the Polypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 
normal expression level of a secreted protein in an individual can be treated by 
administering the polypeptide of the present invention, preferably in the secreted form. 
5 Thus, the invention also provides a method of treatment of an individual in need of an 
increased level of the polypeptide comprising administering to such an individual a 
pharmaceutical composition comprising an amount of the polypeptide to increase the 
activity level of the polypeptide in such an individual. . 

For example, a patient with decreased levels of a polypeptide receives a daily 
10 dose 0. 1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 

polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 23. 

Example 25: Method of Treating Increased Levete of the Polypeptide 

15 Antisense technology is used to inhibit production of a polypeptide of the 

present invention. This technology is one example of a method of decreasing levels of 
a polypeptide, preferably a secreted' form, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 

20 2.0 and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest period 
if the treatment was well tolerated. The formulation of the antisense polynucleotide is 
provided in Example 23. 

Example 26: Method of Treatment Using Gene Therapy 

25 One method of gene therapy transplants fibroblasts, which are capable. of 

expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue -culture medium and 
. separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
tissue culture flask, approximately ten pieces are placed in each flask. The flask is 

30 turned upside down, closed tight and left at room temperature over night. After 24 

hours at room temperature, the flask is inverted and the chunks of tissue remain fixed to 
the bottom of the flask and fresh media (e.g., Ham's F12 media, with 10% FBS, 

penicillin and streptomycin) is added. The flasks are then incubated at 37°C for 

. approximately one week. 
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At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier; P.T. ec aL DNA, 7:219-25 (1988)), flanked by the long 
5 terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
Hindlll and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads.' 

The cDNA encoding a polypeptide, of the present invention can be amplified 
using PCR primers which correspond to the 5 r and 3' end sequences respectively as set 
10 forth in Example 1. Preferably, the 5' primer contains an EcoRI site and the 3* primer 
includes a Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and-Hindlll fragment are added together, in the 
presence of T4. DNA ligase. The resulting mixture is maintained under conditions 
appropriate for ligation of the two fragments. The ligation mixture is then used to 
15 transform bacteria HB 101, which are then plated onto agar containing kanamycin for 
the purpose of confinning that the vector has the gene of interest properly inserted. 

The amphotropic pA317 or GP+aml2 packaging- cells are grown in tissue 
culture to confluent density in Duibecco's Modified Eagles Medium (DMEM) with 10% 
calf serum (CS), penicillin and streptomycin. The MSV vector containing the gene is 
20 . then added to the media and the packaging cells transduced with the vector. The 
packaging cells now produce infectious viral particles containing the gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the 
media is harvested from a 10 cm plate of confluent producer cells. The spent media, 
25 containing the infectious viral particles,~is filtered through a millipore filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the media 
from the producer cells. This media is removed and replaced with fresh media. If the 
titer of virus is high, then virtually all fibroblasts will be infected and no selection is 
30 required. If the titer is very low, then it is necessary to use a retroviral vector that has a : 
" selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
infected, the fibroblasts are analyzed to determine whether protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to confluence on cytodex 3 microcarrier beads. 
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Example 27: Method or Treatment Using Gene Therapy - In Vivo 

• Another aspect of the present invention is using in vivo gene therapy 
methods to treat disorders, diseases and conditions. The gene therapy method 
relates to the introduction of naked nucleic acid (DNA, RNA, and antisense 
DNA or. RNA) sequences into an animal to increase or decrease the expression 
of the polypeptide of the present invention. A polynucleotide of the present 
invention may be operativeiy linked to- a promoter or any other genetic elements 
necessary for the expression of the encoded polypeptide by the target tissue. 
Such gene therapy and delivery techniques and methods are known in the an, 
see, for example, WO90/1 1092, W098/.1 1779; U.S. Patent NO. 5693622, 
5705151, 5530859; Tabata H. et al. (1997) Cardiovasc. Res. 35(3):470-479, 
Chao J et al. (1997) Pharmacol. Res. 35(6):5 17-522, Wolff LA. (1997) 
NeuromuscuL Disord. 7(5):314-318, Schwartz B. et al. (1996) Gene Ther. , 
3(5):405-41 1, Tsurumi Y. et al. (1996) Circulation 94(12):32S 1-3290 
(incorporated herein by reference). 

The polynucleotide constructs of the present invention may be delivered 
by any method that delivers injectable materials to the cells of an animal, such 
as, injection into the interstitial space of tissues (heart, muscle, skin, lung, liver, 
intestine and the like). These polynucleotide constructs can be delivered in a 
pharmaceutical acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences 
that are free from any delivery vehicle that acts to assist, promote, or facilitate 
entry into the cell, including viral sequences, viral particles, liposome 
formulations, lipofectin or precipitating agents and the like. However, the 
polynucleotides may also be delivered in liposome formulations (such as those 
taught in Feigner P.L. et al. (1995) Ann. NY Acad. Sci. 772:126-139 and - 
Abdaliah B. et al. (1995) Biol. Cell 85(1): 1-7) which can be prepared by 
methods well known to those skilled in the art. 

The polynucleotide vector constructs of the present invention usedin 
the gene therapy method are preferably constructs that will not integrate into the 
host genome nor will they contain sequences that allow for replication. .Any 
strong promoter known to those skilled in the art can be used for driving the 
expression of DNA. Unlike other gene therapies techniques, one major 
advantage of introducing naked nucleic acid sequences into target cells is the 
transitory -nature of the polynucleotide synthesis in the cells. Studies have 
shown that non-replicating DNA sequences can be introduced into cells to 
provide production of the desired polypeptide for periods of up to six months 7 . 
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The polynucleotide construct of the present invention can be delivered to 
the interstitial space of tissues within the an animal, including of muscle, skin, 
brain, lung, liver, spleen, bone marrow, thymus, heart, lymph, blood, bone, 
cartilage, pancreas, kidney, gall bladder, stomach, intestine, testis, ovary, 
uterus, rectum, nervous system, eye, gland, and connective tissue. Interstitial 
space of the tissues comprises the intercellular fluid, mucopolysaccharide matrix 
among the reticular fibers of organ tissues, elastic fibers in the wails of vessels or 
chambers, collagen fibers of fibrous tissues, or that same matrix within 
connective tissue ensheathing muscle ceils or in the lacunae of bone. It is 
similarly the space occupied by the plasma of the circulation and the lymph fluid 
of the lymphatic channels. Delivery to the interstitial space of muscle tissue is 
preferred for the reasons discussed beiow. They may be conveniently delivered 
by injection into the tissues comprising these cells. They are preferably delivered 
.to and expressed in persistent, non-dividing cells which are differentiated, 
although delivery and expression may be achieved in non-differentiated or less 
completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take 
up and express polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of 
DNA or RNA will be in the range of from about 0.05 g/kg body weight to about 
50 mg/kg body weight. Preferably the dosage will be from about 0.005 mg/kg 
to about 20 mg/kg. and more preferably from about 0.05 mg/kg to about 5 mg/kg. 
Of course, as the artisan ordinary skill will appreciate, this dosage will vary 
according to the tissue site of injection. The appropriate and effective dosage of 
nucleic acid sequence can readily be determined by those of ordinary skill in the . 
art and may depend on the condition being treated and the route of 
administration. The preferred route of administration is by the parenteral route of 
injection into the interstitial space of tissues. However, other parenteral routes 
may also be used, such as, inhalation of an aerosol formulation particularly for 
delivery to lungs or bronchial tissues, throat or mucous membranes of the nose. 
In addition, naked polynucleotide constructs can be delivered to arteries during 
angioplasty by the catheter used in the procedure. 

The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding 
for the polypeptide of the present invention is prepared in accordance with a 
standard recombinant DNA methodology. The template DNA, which may be 
either circular or linear, is either used as naked DNA or complexed with 
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liposomes. The quadriceps muscles of mice are then injected with various 
amounts of the template. DN A. . . .. 

: Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made 
5 on the anterior thigh, and the quadriceps muscle is directly visualized. The 

. ' template DNA is injected in 0.1 ml of carrier in a I c.c syringe through a 27 gauge 
needle over one minute, approximately 0.5 cm from the distal insertion site of the 
muscle into the knee and about 0.2 cm deep. A suture is placed over the 
injection site for future localization, and the skin is closed wuh'stainiess steel 

10 clips. ■ ' 

After an appropriate incubation time (e.g., 7 days) muscle extracts are prepared 
by excising the entire quadriceps. Every fifth 15 um cross-section of the- individual 
quadriceps muscles is histochemically stained for protein expression. A time course for 
protein expression may be done in a similar fashion except that quadriceps from 

15 different mice are harvested at different times. Persistence of DNA in muscle following 
injection may be determined by Southern blot analysis after preparing total cellular DNA 
and HIRT supematants from injected and control mice/ The results of the above 
experimentation in mice can be use to extrapolate proper dosages and other treatment 
parameters in humans and other animals using naked DNA of the present invention. 

20 It will be clear that the invention may be practiced otherwise than as particularly 

described in the foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of the above teachings and, 
therefore, are within the scope of the appended claims. 

The entire disclosure of each documenc cked (including patents, patent 

25 applications, journal articles, abstracts, laboratory manuals, books, or other 

disclosures) in the Background of the Invention, Detailed Description, and Examples is 
hereby incorporated herein by reference. 
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Sequence Listing 

(1) GEMERAL I2IFORKATICN: 

(i) APPLICANT!: Human Genome Sciences, Inc., et al. 

(ii) TITLE OF INVENTION : 207 Human Secreced Pro ce ins 

(iii) NUMBER OF SEQUENCES: 300 

■ (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE:' Human Genome Sciences , Inc. 

(B) STREET: 9410 Key Wesc Avenue 

(C) CITY: Rockville 
CD) STATE: Maryland 

- (E) COUNTRY: USA 
(?) ZIP: 20850 

(v) COMPUTE?. READABLE FORM: 

(A) MEDIUM TT?E: Diskecce, 3.50 inch,- 1.4Mb storage 

(3) COMPUTE?.: K? Veccra 436/33 

(C) OPERATING SYSTEM: MSDOS version b . 2 

(D) SOFTWARE: ASCII Text 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 
(3) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 
'.(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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(viii) 'ATTORNEY / AGENT INFORMATION : 
(A) NAME: Kenley K. Hoover 
(3) REG ISTRAT ION NUMBER: 40,302 

(C) REFEHEHCE/DCCKST" NUMBER: P2QQ7PCT 

(vi) TELECOMMUNICATION INFORMATION : 
" (A) TELEPHC'NE : _ (301) 309-8504 
(3) TELEFAX: (301) '309-3439 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYRE: nucleic acid 
..(C) STRAND EDNESS : double 

(D) - TOPOLOGY: .linear 

(xi) SEQUENCE DESCRIPTION-: SEQ ID NO: 1: 
GGGATCCGGA GCCCAAATCT TCTGACAAAA CTCACACATG CCC^CCQ^GC CCACCACCTG 
AATTCGAGGG TGCACCGTCA GTCTTCCTCT TCCCCCCAAA ACCCAAGGAC ACCCTCATGA 
TCTCCCGGAC TCCTGAGGTC ACATGCGTGG TGGTGGACGT AAGCCACGAA GACCCTGAGG 
TCAAGTTCAA CTGGTACGTG GACGGCGTGG AGGTGCATAA TGCCAAGACA "aAGCCGCGGG 
AGGAGCAGTA CAACAGCACG TACCGTGTGG TCAGCGTCCT CACCGTCCTG CACCAGGACT 
GGCTGAATGG CAAGGAGTAC AAGTGCAAGG TCTCCAACAA A3CCCTCCCA ACCCCCATCG 
AGAAAACCAT^CTCCAAAGCC AAAGGGCAGC CCCGAGAACC ACAGGTGTAC ACCCTGCCCC 
CATCCCGGGA TGAGCTGACC AAGAACCAGG TCAGCCTGAC CTGCCTGGTC AAAGGCTTCT 
ATCCAAGCGA CATCGCCGTG GAGTGGGAGA GCAATGGGCA GCCGGAGAAC AACTACAAGA 
CCACGCCTCC CGTGCTGGAC TCCGACGGCT CCTTCTTCCT CTACAGCAAG CTCACCGTGG 
ACAAGAGCAG GTGGCAGCAG GGGAACGTCT TCTCATGCTC CGTGATGCAT GAGGCTCTGC 
AOiACCACTA CACGCAGAAG AGCCTCTCCC TGTCTCCGGG TAAA-TGAGTG' CGACGGCCGC 
GACTCTAGAG GAT • 

(2) INFORMATION FOR SEQ ID NO : 2 : 
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(i) SEQUENCE OiABACTHHISTICS : 

(A) LENGTH: S amino acids 
(E) TYPE ; amino acid 
(D) TOPOLOGY : linear 

Ixi) SEQUENCE DESCRIPTION: SEQ ID MO: 2: 

Trp Ser Xaa Trp Ser 
i 5 



(2) IMFOPMA.T ION FOR SEQ ID NO: 2: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH ; So base pairs 
(3) TYPE : nucleic acid 
<C) STRANDEDNESS : double . 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 3: 

GCGCCTCGAG ATTTCCCCGA AATCTAGATT TCCCCGAAAT . GATTTCCCCG AAATGATTTC 

CCCGAAATAT CTGCCATCTC AATTAG , 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 27 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 
GCGGCAAGCT TTTTGCAAAG CCTAGGC 



(2) INFORMATION FOR • SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CTCGAGATTT CCCCGAAATC TAGATTTCCC CGAAATGATT TCCCCGAAAT GATTTCCCCG 



AAATATCTOC CATCTCAATT AGTCAGCAAC CATAGTCCCC CCCCTAACTC CGCCCATCCC 
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GCCCCTAACT CCGCCCAGTT CCGCCCATTC TCCGCCCCAT GGCTGACTAA TTTTTTTTAT 
TTATGCAG AG ■ GCCGAGGCCG CCTCGGCCTC TGAGCTATTC CAGAAGTAGT GAGGAGGCTT 
TTTTGGAGGC CTAGGCTTTT GCAAAAAGCT T 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE:- nucleic acid 

(C) STRAMDEDNESS : double 

(D) ' TOPOLOGY: linear 

• <xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 6: 
GCGCTCGAGG GATGACAGCG ATAGAACCCC GG 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 
(3) TYPE: nucleic -acid 

(C) STRANDEDNES S : double • 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GCGAAGCTTC GCGACTCCCC GGATCCGCCT C 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) "LENGTH: 12 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) ' TOPOLOGY: linear ■ 

(Xi) SEQUENCE' DESCRIPTION : SEQ ID NO: 9: 
GGGGACTTTC CC 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 73 base pairs 

(B) TOPE: nucleic acid 
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CO STRANDEDNESS : double 
(D) TCPOLCGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCGGCCTCGA GGGGACTTTC C CGGGGACTT TCCGGGGACT TTCCGGGAGT TTCCATCCTG 

CCATCTGAAT- TAG 

{2) INFORMATION FOR SEQ . ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) - LENGTH: 256 base pairs- 
IB) TYPE: nucleic acid 
CO STRANDEDNE3S : double' 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTCGAGGGGA CTTTCCCGGG GACTTTCCGG GGACTTTCCG GGACTTTCCA TCTGCCATCT 

CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC 

CAGTTCGGCC CATTCTCCGC CCCATGGCTG ' ACTAATTTTT TTTATTTATG CAGAGGCCGA 

GGCCGCCTCG GCCTCTGAGC TATTCCAGAA GTAGTGAGGA GGCTTTTTTG GAGGCCTAGG 

CTTTTGCAAA AAGCTT 

(2) INFORMATION FOR SEQ 10 NO: 11: 

■ (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2526 base pairs 
(3) TYPE : nucleic acid ■ 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: II: 
GACAGGCTAT CCGAGAATCT GAGAGCTGGG CCCGGCAATT CCTCCAGYTA CCCTTGTGAC 
CTAAGTCCAG TCACACATTT CCCAAAGTTT CTCTTTGTCA TAACCCTGGT CTGGCTGGTT 
TTGRGGRCTT GAGAATGGGT CAGGGACTCC AGGCCAAGTC CAACAGAGAC CCCAAACCCA 
CCACACACCA GCAGCCACAA CCTCACCACC AAGAAAGAGG ACTTTTGTGG GGCCACAAGT 
AAGAGGTCAT TTCTGGAATG GACTCAGACC TTTAAACAGG AGAGTTGAGC ACTTCCAGK3 
AGTTTTTAAG CAAGGCATGG GGAACAGGGA ATAGAACCTT TCAAAGAGGT TGCC CAGAGA 
AAAGCTGGGC CTCTTGCATT CGGCTTCCTT GGAGCAGCCT CTTCTGGCAG AAAGCCATCA 
GGTGCTCAAT CATCTTCTCC TGGCCAAGGC TCTGACCATG CTTAGTACTG GAAT AG AGGT 
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GGCCAGGCCC CCAGCGACTC TTCTTGGCCT GATGTTTGTC CTCACAGGCA TGCCACGTGG 540 

CCTGAGATGA TTCAGAACAA ATCATGCTAA. CTTTGAATCC ATCCAGCCAC TTGCAAATGA 600 

TAATCAGAAG TCAGCTTGTT CACTGTTAGA AAGAAACTAA CAAAAGACAA CCCAGAGCAA 66Q 

TGTAGAATCT TTGAGTGCTT GGCTTTCCAA GGATACTGCG GAGAGTCTGG CCAAGCTGAT 720 

GAMCTTCTGA ARTGTCACTG GCACC ATATG CAACAAGAAC CACCATTCAC TGAGTAGCTA 730 

ATGGGTTTGG GGCCTGGGAC ATTCCATCTG AGGTCCTTCC TGAACATGTC ACTCCACAGC 840 

AGAGGACGGG TTGCAGCTTA GCCAGAACCA CTCGTCCA.GG AGAGCTGGAT GTTTTGCGTG 9.00 

GAACA.CCTTG AGCACTGACT GCTATTGTTC AAAAAAAGCC TTTGCTGCAT TCGGAGGACT 960 

GCCCCGTGCC CTGAGGTGAC TTCCTAACTA TGTGGTTTCA TTAGCGAATT • TATTTTTTGT 1020- 

GCTGGGTGGA CATTTGTATT TTGTTAGGTT GCTGTTTAAG CTCAAGTTTG CTGTGCTCTG 1080 

TGCAGCTACA AAACATCTTG GCATATTTAA GAKTGGCTTT TATAAATAGC TTTATTGTGA 1140. 

TATTAATCAG ATTCCCAACT TTACTGAGAA TTAAGGA.CTG GGGTACTTTA AAGAAATGCA. 1200 

AATAGCAA.TT GAAGAACCAC TGCTGCAGGT -GGTAGCCGTG GCTAGACTGA ATTACACTAG 12 SO 

AAATCAGCCA GAAGGAAGCG TCCTTGGGAT CGCAGATCAC TCTTTTTTTT TTTTTTTTTA 1320" 

AAAGGGGCAG CCCCTTGATG GCTCATCTCT CTGAATAACA GTTACGTCTT CATATCGATA 1330 

■ CCAGATGCCT TCTTCATCAT GCCACTGAAG CCACTCACCA CCTTCAAGAA CATGCCAACC 1440 

TCTGTCAGAT TCACTTACCC ACAAAC AAGG - AGGCACGTTT GGCACAAAGT GTTGTCCTCG 1500 

AGGTCCAAGT GGACTCTACA GAGTGCTTGA CCTCAACACA CTGGATTC CA GGTGGACTGG 15 60 

ACCAAGAGCA GGCAAAGACA CGGGAACTGA AAAACTCCAC ' AGGGTTTGGA GAA.TAGAAAT 1620 

GAAAAGCCAC GTCAXATAAC TCAAGAATAA ATGGTGTTTT GGAAATTTTA AAATTATCAT 163 0 

CGAAGGTGGT GAAACTATTT CAGGCCCAAA TGAAAGGAAA TCGCCAGTTG GGGATGAAAT 1740 

CACAGAGCCT GTGTTTTATG AT ATGGTTGG • ATGTCCACTG ATGAAATTTT AAAGGAGTTT 1300 

CATTTTTAAA AGTGCGCATG ATTCT ACAT A TGAGAATTCT TTA.GGCCAAG AAACTGTCCT 1360 
.TGGCTCAGAG GTGTTGGGAA TTAAAGCAGA GAGAAGCCAT TCGTGATGCT TAGAACCAAG .' .1920 

50 . GA.TGGTCATG TACACAAAGA CCATCGAGAC GGCCATTCTT GTTTACAAAA CACTTACCAA 1980 

GAAAGCACTT TGTAGGGGAA CTTTAGTAAG TTCTTCTCAT TTCATTATGT TTCTTCCAAG 2040 

GAAACAGGAG AGACTGAATT AATAATTCTC TCTTTGCTGT TAAGCACTTT TAAAATAATA 2100 

AAGTACATGT TGAAATTTGG GGGGGCATCT CTGA.TTTAAA AAAAGAAAAA GGCTGCTTGA 2160 

TGTATGTTAT GCAGAGACAC TCTGCCTCTG GTGGCTGCAG AGGAATACCC AAGCCTCATT 2.220 

TGGAAGGCTC AACATTTGGA ATTGCACTTT AA.TTGATTAA TCCTCAATTC ATGTGGCCTT 2230 
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ACGGGATGGT GGGTCTGGGA CCCCAATTCA TTCTTATCTG CCAAAGAATT ATCTAGAAGC 2340 

ACATCAAATA CCAGCACCCC ACCTGCACAA TGGGGGTGGA AAACTTTTGT ATCCCTAACC 2400 

5 

ATATTATTTT ATAGTGTCTG CCATCCCATG TGGAAATACT TTATTTTTAA CCTCAGGATT 2450 

TAAATAAAGT AAACACTATG ACATTTAAAA AAAAAAAAAA AAAACTCGAG GGGGGCCCGG 2520 

10 TACCCA ■ 2526 

15 (2) INFORMATION FOR SEQ ID NO: 12: 

• ( i.) SEQUENCE CHARACTERISTICS : 

(A) .LENGTH: 1121 base pairs 
(2) TYPE: nucleic acid 
20 (C) STRANDEDNES3 : double 

(D) TQPQLCGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

25 O.CTGCACCA GCTTTGTTAT CTGTAAAATG ATGATAATAC CAACACCTTC TTCTTGGGGT SO 

ACTGAAGATG AGAGAACATG ATATGTGTAA AGTGCCTTCC ACAATACCCA GAACA.TAGCA 120 

A-.CATGTAAT GAATGTAGTA ATAGTAATTA TTTTATTTTC TTTTGATTCA GTTGGGACTA 130 

30 

TGTTCAGCTG TAACAGAATA CCCAAAATAA CTGTTTTAAA CAAATTAAAG TTTWGTTGTG 240 

AAGTTTTGTT ACGAATTCAG ACAATCCAGG GCTTTT AT AG ATGCACCAGG ATCAGCAGGT 300 

35 . ACAAAGGCA.T CTTTCCTGAT TTCTGCCAGT CTCAATGCAT GGGTTGCAAT CCAGARTCCA 360 

RGATGGCAGT TCCAGCCCTG GTTACGCCCA TATTAGCACA O^GAAAGAAA GAGAAAGGGA 420 

TGTGCCTCTT CACTTTAATC ATAGCTCCCA CTAGATGCAC CCACTACTTC .TGCTGATACT ' 430 

40 

" CCATTAGCTA ATGCTTGCTT ACATGGTCAC ACTTAGTTTC C^GAGAGACA TGTCTGGACA . 540 

GTCATGTGCT CAATTAATAT CCAAGTGTCC AATTACTGAG "AAAAAAAGAA ACTAGCACCT 600 

45 TTGCTTGGTT GCATTCCTCT TAGCATAAGC CACATTCTTT TTATGAAGTT GTCCTCAGTT 560 

' ACTTGGATGC CTCAGTTGTC CTTTCAWTTA GAAAWGCYCC TKGGACA.YCC TGAAWCTGAC 720 

TTCTTTTGTC ATCAGCACCA TCACTACCAC TGCCYTCTTC AAAGCCACCA CGTTCTGTCC ■ 730- 

50 ' 

CCAGGATGGT TGCAACAACC ACCATAGGGA CTTTTTGCCT ~ TCTACTTCCA CACAATAGNC 340 

CAGAGTAAGC TTTTGAAAAT GTAGGTCAGA TCATGTCTCT CTCTTCCTCT TCAAAACCCT 900 

55 CCCGATGGCT TTTCATATTA CTCAAAAGAA AACCTAAAAC TTTGCTGTGA GATCTATGTG 960 

ACCCGGCTTA TTCTTCCTCT TACTTTATCT CTGTATTGCT CTTCCTCACT CTACTCCAGC 1020 

" CATCCCACCT CCTTGCTGCT TGTCCTATAC TCCTAAAAGA AGTTCAGTCT TCCCTTATGA 1030 
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TATTTGCACT TAAAATAGAA AAAAAAAAAA AAAAAAAACT CGAGOSGGGC C 1131 



5 

(2) INFORMATION FOR SEQ ID MO: 13 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 541 .base pairs 
10 (3) TYPE: nucleic acid 

(C) STRANDEDNE3S : double 

(D) TOPOLCGY: linear 

(:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 13 : 

15, 1 

. GGCACGAGTA GCATTTCATT TAATCTGCAG GTATATTCTC CCAACAGTTT ATTGTCATGT 60 

GATGTQCTCA GCCAAGATTG TRAGGCAGAG AGGAGCTGTC CCAACCTACT ATACCACCGA 120 

20 GGCTGGAGAG ATCATATTTT TGGTATTAAA CTGGAGTCTC TCCATCCTTC ACATTGTTGA 130 ' 

TGTCCTCTGT AGCAAACCGG AAAAGTCAGT GACAGAAGAT GCCGCTASCG GTTTGACCCA 240 

GAGAATGACA GCTCTGGTTT GGAGAAAAGG GCCGGATGGT GGCTCTAGAA AGCCCATCCT 300 

25 . . 

TCTGCTCTTC T TTTT T CTCC CCCTTATATT GTGCTTTCAT TCATTCATTC ATTCATCAAA 360 

CATTTGTTGA GCACCTATTA TGTGTCAAGC TCTGTGCTAG CCTCTGGAAA ACCTGCCCTC 420 

30 " ATGTAGCTCA CTGTGGAGTA GGAGAAACAA TGACTACACT ATGATAAGCA CGGGTTGTCA '480 

GGGTCTCACA GAGCAGTGGC CCCTCATCCA GACCGATGAG GTCAAAGAAG GCATCCAGGC 540 ' 

GAGGATGGTG TCAGAGCTAA CTGAAGAATG AGAGGGAGCT GCACCASCAG GGGTTGGAAC 600 

35 

TGAAGGTGGC AGTGCCTGGA GTCTTGATTC C^GCAGAGGG AGAGCAGTCT GTGAAAAGGC '660 

ACCAAGGGTG GGAGAGGGCA GAGCACATGG AGGAACTTCA GGTAGTTCTG GATGGCSCTG 720 

40- GGGCAAAGCT AGAGAGGTAA GAAGAATCTA CAAATGTTCC TCGAGTTACA TGAACTTCCA 780 

TCCCAATAAA CCCATTGGAA ACGAAAAATT TAAGTCAGAA GTGCATTTAA GGCTGGTCCG 840 

AGTAGAATGA TTTTTACAAC GAATTGATCA CAACCAGTTA CAGATGTCTT TGTTCCTTCT 900 

45 

CCACTCCCAC TGCTTCACCT GACTAGCCTT TAAAAAAAAA A 941 



50 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 base pairs 
55 (3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
"• (D) TOPOLOGY :. linear ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

60 
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CNAGGGATAA CCCCAAAGNT GGGAAATAAA CCCTCAATTA AflGGGGGAftC CAAAAAGCTG 
GGAAGTTCCC CCCCGCGGTG GCGGCCNGMT CTAGGAACTA GTGGAATCCC CCGGGGCTGC 
AGGGAATTCG GCACC-GAGTG GGAATGTTGT TTGTATGATA CTATTTCCAC AAWATGCATT 
GAGACTTGGT XTGTGGCCTA CGACATGGTC AATTCTTTYT AAATATTCCG TGAATTTCTT 
TAGTGCATAT TCTCCGATGG GGGCTGTGGG GACAGAGTTC TAAATATGGC CATTAGATTA 
AATCTCTTCA TTCTGTTGCT CACATCTTCT ATATCCTTAT TAATGTGTCA ATCTCTTCAA 
GAGAGGTGTT ATTAAAATCT CTGCTGTAT GTGTCACTTT GCCCTTAAAA TTCTGATGAT 
TTGCTTTATA AATGGTTATA ACCATTTTCC AGGAAGAACA . TTAAAGAACT TTCCATTGGC 
ATTATCCAGT TTCCCTCAAA ATACTGGTTT TTTTTATTTT GGCTNCTAAG CAGCTATGAA 
TCCAGTTTCT CAGAAGCCCT TGTCTCAAGG CATTTGTTTC CAGATTACCT TGTTAGCATC 
CACACTATGG GCTATTTTAG AAAAACAAAA "AAAGTATCAA AATCATATAG CTATGATTTT 
CCTGTGCTTG AAGGAGCCTT AAAGGTCATC TAGTGCAGGG AGTATTTGTT CATCCAAATT 
CTGCCAAGAA ATCTCTATTG TCAAGATA.TT CTTTACCATC TTTGGGACAT TCT CATT ATT 
AGAAACAAAT CCTAAGAAGA AATTCTGCCA TAKACAACCC ATCCGTTCTT TAAAAAAAAA 
AAA 

(2) INFORMATION FOR SEQ ID NO: 15:' 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH:. 1013 case pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDME3S : double 

(D) TOPOLOGY: linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO; 15: 
CTGTAATTTT TAATTTTCAT ATACCGTGGT TTGATTCTAA TTTTATTTTT TGAGTTCTCT 
GAAGGTTACA TATACAGAGT GCTTCAGGAA TGATCATTTT GTTATTATTC ATGCTTCTTA 
AOATGTTGT TTTAGTCCAA GAAGATAATT .GCCAGAGAAA GAATACAGTG ■ CAGGAAAGAA 
GARGGTGGAG CCAGTGGTGA AGARGGATTG AGAPGACAGA CATTGTGGGA ATGAAATCAT 
GAATAATCGT GTTTTTGAAT TGTCCAAAAA CTTCTACAAA CCATGAAATG TTGGAGTTTA 
AATCTAATTG TTGAAAAATT CCCCACATTC CTTGTATCCC TTAGGTTGAG CATAATTCCA 
CATCCGTGGA CTGATGCACT TCCCAAGAGG GGGCCTCATT AACTCTTCCG AGGCAGCAGC 
AGCAAGGGCA CCCCCTCCTT TCCCCCCACA CCC-CA^TTCT CATGGCTCTT CTTTCTCTCA 
TCTCATGCTT ACGTTAGAAA AGGGCA.CAAG GTAAGGAAGC CCTTGGGAA.T AGGCTGAATC 
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TGGCTATCTA ATTTGGTGCC AAATACTTAA TGTGCTTGAA TTTAAAAACA GC AAACATGT 600 

AGAAAGGTAA TTATAATTAT GAGGCCAGTT CTTTAAGCTA GCTTTTTTTC CCCTCTCAAA 660 

" CAGCATATTG GCTTGGATGT CAGCAGGAGA AAGTGTTTTT TGCAATACAC ATAATGCATA 720 

TATGGTCCTG TTAGGAATCT ATAGAAAATA GATATTGCTC ATTAAGGTAA ATATTTTTGT 780 

TGATGAATGA TCTGGAATGG TCTGGACTTG TTGTGTGAAC AGGAAATTGC TCTGTAGGCT 340 • 

TTGACTTGTG AGGTAAAGAG TGAGGCTGGT AAGATTAATT AAAGTAAATA CTGTGACAAT 900 . 

• AGGATGTCAA AACCAAAAAC GTGTTTCTGA AACTCAAGGA. ATTAATGACA. CATAGGGAAG 960 

TTTTTGCCAT ATTAAGCATA GAGTAGGAGA GGCAAGTCAA GAATAAAAAA AAAAAAAA 1013 



(2)' INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base cairs - 

(B) TYPE : nucleic apid 

(C) STRANDEDNES3 : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



TTTAAGAAAT TAGTGAATCC CCGGNTGCAG GGAATTCGGC ACGAGGAGGA GGCCGTCAGC 50 
TGGCAGGAGC ^ GCAGGATGGC AGCTGYTCCC CCGGGTTGCA CCCCCCCAGY TCTGCTGGAC ' 120 

ATAAGYTGGT TAACAGAGAG CCTGGGAGCT GGGCAGCCTG TAC CTGTGGA GTGCCGGCAC 130 

CGCCTGGAGG TGGCTGGGCC. AAGQAAGGGG CCTCTGAGCC CAGCATGGAT GCCTGCCTAT ' 240 

GCCTGCCAGC GCCCTACGCC CCTCACACAC CACAACACTG GCCTMTCCGA GCTGCTGGAG 300 

CATGGAGTGT GTGAGGAGGT GGAGAGAGTT CGGCGCTCAG AGAGGTACCA GACCATGAAG 360 

GTGCGCAGGG CAGGGCTCGG ACCTACCCCA GGAATGTCCT GCCCTGGGAA TGACAACACA 42 Q 
GTCCACACCA TGCACGGGGA GGCAAACAGG GGCAGCTGAC CCAGCCCAGG GGTCAGANGA . 430 

GGTCTTGCCG AGGAAGTGGC AGCTAAGCTG ATACCTGATA TGCACWAGKC AGCCARGYGG 540 

AGACAGGCAA GGAAGAAGCT TGTTTTGAGG ACAGAATTTT CTAGATCACT CAGCACCATC 600 

TGGCTTTTGG GGCTTTTTGT TTTATTTTGT TTTTGAGACG GGGTCTCGCT CTGTCGCCCA 660 

N '661 



(2) INFORMATION FOR SEQ ID MO: 17: 

(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH : 533 base pairs 
(3) TYPE: nucleic acid 

(C) STFANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: "SEQ ID NO: 17: 
GGCACAGGGC TATTTGCCCC TCTCTCCACA TGACAGAA.CT GCTCTAAGTT TCTTTGCTGC 
TCTTCTCAGC TGTCAGACGG ' CTTGCTGCTT GTTTTCCACA CCACCATGTC TATTCTTTGC 
TGTCCTTWAC TCTGCCTGTT TTTTTCCTTT TGTATTTCTT .CTGGCTCTTG TCCCTTTTCC 
CACGTGTCWC AC-CTTTCCTT TATTGCCACT TTCAGTCAGA GCAGTCCTGT GCTTCTGGTG 
CCGGCATACA ATACTTACTT GAGTTTCTTG GCTTTTCTTG ACTGTGCATC TCTTACTTCn 
ACATAGGAAT AGCCTGTCAT AGAATTTCTC CAGTTCCAGG GCTCAAGAGG GAGAGTGCCA 
GAAAATTGAG ACTGTTTTCC CTGTCTTGGA TTGAATTCAT AAAGCAAAAC CAGTGTTTGT 
GTGAGGGTTT GCTGTGTCAT GCCTATAGGT TGTTTGGGTG CAAACCTATA GAATCCAGCC 
TGCGAAAAGA AAGRAACCAG AGAATANCAG CATCAGAACA ATGCTTGACA TCATTTCTCA 
ATCAAGCAGT CCA 

(2) INFORMATION FOR SEQ ID NO : 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 
(3) TYPE: nucleic acid ' 

(C) STRANDEDMESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGCACGAGCT GCCAACACTG AGGTCTTCGT GGCTTCTCAC ATCTAGATGT ATCCCTCTCA 
AATCTATCCT CTATCCAGGC ACCAGATTGA GGTATCTAAA ATGTCAACTT TC'CAGTTACT 
CCTTCTTATA CTAGCOCAAT CAACTTACAA GATAAAGTCC AAGCCCCTTC ATATGACAAA 
CCACACCCTG CTTAACTCTC CAGGTTTGAA TCCTTCATCT CCTACTTTAA ACTTTAAAAC 
CCAGCAGCAC GAAAGTGTCT CCTATGCATG TTGCCATATG CGTTCTCTCC ATCATGCATT 
TGCCTGAGCA AGATGTCTTG AGTTAACATC . TTATTCTTTA AGACTCATTG TGGTGGTAGA 
CAGCCTTTAA TAACGGATCC TTGGCCAGGC ACAGTGACTC ACACCTGTAA TCCCAGAACT 
TTGAAAGGCC AAAGAAGGAA GAAAGCTTGA GGCCAGTAGT TTGAGACCAG CCTGGGAAAC 
AGAGAGATAT CCCATCTGTA CCAAAAATTT AAAAAAATAT TAGCAGGGAG TAGTGGCATG 
CACAAGTGGT CCCAGCTCCA TGGGAGASTG AGGTAGGAAC ATCACTTGAG CCCAGGAAGT 
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CAAGGCTGCA GTGAACCATG ATCAGAACAT TGCANTCCAG CTTGGGTAAC AGAGTGAGAC ' 
CTTAGGTCAG AAAAATGAA.T AAATAAGCAT AAAATTTTAA AAACTTAGCC AGGCATGGTG 
GCACACATCT GTGGTCCCTG CTACTTAGGA GGCTGAGGTG AGAGGATCCT TGAGCCCAGG 
AGGTCAACAC TACAGTGAGG TATGATTGTG CCACTAAACT CCAACCTGGG TGAAAAAGCA 
AAACCCTGCC AAAAAAAAAA AAAAAAACT 

(2) INFORMATION FOR SEQ ID NO; 19: 

(i) SEQUENCE CHARACTERI ST ICS : 

(A} LENGTH: 959 base pairs- - * 
(3) TYPE: nucleic, acid 

(C) STSANDEDMESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGCGAGCCGA GATCGTGCCA TTGCACTC CA GCCTGGGCAA CAAGAGTGAA ACTCTGTCTC 
AAAAAAAAAA • AATTAT AAT A CTATATGCCA TAAAATGACA TTTCATATTT AAAGAGTTTT 
• TTAAAACTCT TGTATTCACA TGCCATAATT TGAAACCCTA TTTCACTGAA TGAGAATGGT 
ATCTGTTGTC CTCATTTTTT CATTTTTATC CTTAACAATT TCCACCACAG CCA.GTGCATA. 
TAATGGCAAT GACACCCAGG GATGGAATGA TAAGTTCCAt CRCMGCTCAG TCAAiGACGCA 
GACTTGATGT GGCCCCAACA ACAGTCAATA ATGGAGTCTC CAAAATAAAG CTCTATAGGA 
AAGGTAAATA CCCGCTGCAC AAGAAACCAC AGCATCTAGG TTCTAACCCC ATCTCTATGA 
AGAGCTTGCT GGGAGAGTTT TGACATTWAA CAATCTGTCT GATKGCCAAT TTTYTTCTTC 
TATAAAATGA TAATGTTKGA YTCAAAGATC CAAAGTCAAT • TCATGGTCTA AAACTTAATG 
ATTTTTTTAG GTTTTGKGAC ATTTCACTGT ACACTGTAGT AATTTATATC TTA.TTTTCCC 
ACTAATTTAG AAAAATATYT AAATGATCCT TAATTGGCAA TGGGTCCTAA GAATTTTGTT 
TTAAATCCCT G TTACCCAAA - AGAGCCCTTT TTTGTATCTC GCAGTAGTTA CAAGGATCTT 
TCTAAATCTT AAAAAAAAAA AAAAAAGAAA GAAAGAAAAG AAAAGAAAAA. AAGTCAGCCG 
GGCGTGGTGG CTCATGCCTG TAATCCCAGC ACTTTGGGAC CAAGGTGGAC AGATCACGAG 
GTCAGGAGAT GGAGACCATC CCGGCCAACA TGGAGAAACC. CTGTCTCTAC TAAAAAAAAA 
AAAAACTCGA GGGGGGCCCG GTACCCAATN CGCCGGCTAG TGGTCGTAAA ACAATCAAA 



(2) INFORMATION FOR SEQ ID NO : 20: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1445 base pairs 
(3) TYPE: nuci-aic acid 

(C) STEAMDEDME5S : double 

(D) TGPQLCGV :- Linear 

(:ci) SEQUENCE DESCRIPTION: SHQ ID MO: 20: 

CGGGGCAGGG CTGTGTGGCA CCC-COAGGGA GOGGGCCCAC CTGAGTCACT TTATTGGGTT 60 

CAGTCAACAC TTTCTTGCTC CCTGTTTTCT CTTCTGTGGG ATGATCTCAG ATGCAGGGGC 120 

TGGTTTTGGG GTTTTCCTGC TTGTGCCAAG GCKTTGGACAC TQCVZOGC-GG CTGGAAAGCC 130 

15 CCTCCCTTCC TGTCCTTCTG TGGCCTGCAT CCCCTCATGG GTGCTGCCAT CCTTCCTGGA 240 

GAGAGGGAGG TGAAAGCTGG TGTGAGCCCA GTGGGTTCCC GCCCACTCAC CCAGGAGCTG 300 

GCTGGGCCAG GACCGGGAGA GGGAGCACTG CTGCCCTCCT GGCCCTGCTC CTTCCGCAGT 360 

20 - ' 

TAGGGGTGGA CCGAGCCTCG GTTTCCCCAC TGTTCTGGAG' GGAAGGGGAA GGAGGGGGTC 420 

TTCAGGCTGG AGCCAGGCTG GGGGTGCTGG GTGGAGAGAT GAGATTTAGG GGGTGCCTCA 430 

25 TGGGGTGGGC AGGCCTGGGG TGAAATRAGA AAGGCCCAGA ACGTGCAGGT CTGCGGI^GGG 540 

GAAGTGTCCT GAGTGAAGGA GGGGACCGGC A'TGCTGGGGG ATGCTGGGAG TGAGTGAGTG 600 

AGATGGCTGA GTGAGGGTTA TGGGGAGCCT GAGGTTTTAT GGGCCTGTGT ATCCCCTTCT ' 660 

30 • 

CCCGGCCCCA GCCTGCCTCC CTCCTGCCCG CCTGGCCCAC AGGTCTCCCT CTGGTCCCTG 720 

TCCCTCTGGT GGTTGGGGAT GGAGCGGCAG CAAGGGGTGT AATGGGGCTG GGTTCTGTCT 730 

35 TCTACAGGCC ACCCGGAGGT CCTCAGTGGT TGCCTGGGGA GCCGGACGGG GCTCCTGAGG 340 

GGTACAGGTT GGOnCGGGCCC TCCCTGAGGG TCTGGGGTCA GGCTTTGGCT CTGCTGCCTC 900 

TC^GTCACCA AGTCACCTGC CTCTGAAAAT CC^GTGCCTT CTTTGGATGT CGTTGTGAGT 960 

40 

CACTCTGGGC CTGGCTGTCG TCCCTCCTCA GCTTCTTGTT CCTGGGACAA GGGTGAAGCC 1020 

AGGATGGGCC CAGGCCTGGG ATCCCCCACC CCAGGACCCC OVGGCCCCCT CCCCTGCTGC IQS0 

45 'TTTGCGGGGG GCAGGGCAGA AATGGACTCC TTTTGGGTCC CCGAGGTGGG GTCCCCTCCC 1140 

AGCCCTGCAT CCTCCGTGCC STAGACCTGC TCCCCAGAGG AGGGGCCTTG ACGCAC^GGA 1200 

CGTGTGGTGG CGCCTGGCAC TCAGGGACCC CCAGCTGCCC CAGCCCTGGT CTCTGGCGCA 1260 

50 

TCrCTTCCCT CTTGTCCCGA AGATCTGCGC CTCTAGTGCC TTTTGAGGGG TTGCCATCAT 1320 

• ' CCCTCCCTGA TATTGTATTG AAAAT ATT AT GC\C\CTCTT CATGCTTCTA CTAATCAATA 1330 

-55 AACGCTTTAT TTAAAGGGAA AAAAAAAAAA AAAAAACTCG AC<^GGGGCC CGTACCCAAT 1440 
TCGCCA "1446 
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(2) INFORMATION FOR SEQ ID MO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1471 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CAAAAAATAA TAATGATAAT TTAAAATAAA TAAGTAACTA ATAAAAAGAT TTTATATCCC 60 

AGTCTTATGA TGTTGGTTGG CAAGGCTAGA TAAAAAGATG TTA.GAATGAA. AGAAOTATt' V>Q 

15 . 

TTTAGTGATA' tgtaaatgaa ggattctaca atagtcatat atttttatat GAATGAATGT 130 

TGGGTTGGGC TGGAGAGGTA TGTGTGTGTA AATATAAAGG TCTCACATTC AGAGTATAGC 240 

20 TCTGAAATAA TGGAACTCAT GTCTACAATT CAACATGCAT CTGTATAGTT ACATCTCATG 300 

TAAATATACA- CAGACATA.TT TTGC^GCCAG TAATTGACAG TTAATGTCCA AAACAGGTGA ' 360 

- - TTGATAGGT A ACAGAAATTA GATAACCACC AATTTTGCCC AAGAGAAAGA' CTAGAAGGAC 420 

25 

. TAAAAGCAGT ■ TGAATGTATG GTACTGACAT TGTCATAAGC AGTCTGATAA CCAGTTTATT - 480 

GAAACGTGTG CATTAACAGA GAATTTAATT TTAAACCCAT AATTTCTCCT ATCCATTAAA 540 - 

30 ATATTATAAT TGTTAGTAGT ATGAAACCAA CAGGAAATGT TTTTTAATCA TTTAGTGAGG 600 

TGATXCATTT GTTTCATGGG CAAACACTAT CCAGGAAAAG CCTTGCTTGC CTGTTTCCCA 660 

AAGAGCTCTA AGAAAT.AGAA TCAAGTGTAA AATGGTTGAG AC C ATTCAGG ATTTCTTGTC 77 Q. 

35 

ACTCTTCTCA ACCCCGATCT TCCTGTTATT ACTGATGTTT GAAACCCTGT • CATTAGCCCC 730 

GGCCTGGTTA AAGCCCCTCA GAGTCACCTC TCATTCATAG CAATAGAATT CAACCCCAAG 340 ' 

40 TGGTTGATGG TGTCCCCAGC ACAGCCGAGA GACCTGATCT CTGGATTCAG TGCTTTTAGC 900 

TCTTCGAGTT TACCCTAAGA TACCTTCGGG CAATATTTTT AACCAACCCA AAAGCTCTTC 960 
AGGTCATTTC TGAAGAGGAC AAGGTGAA.TC TTGGCTTGGA ACACCATTTT TGGGCTCTTG . 1020 

45 , - 

CTACTGAATG AATCAGAAAG GAATTTTTTC TGAAGAGCAT TAGAAAGTAA AGGA3ATGTT 1080 

AAAATAAGTT CTTGAAGTAT GTTTTATATT TATCTAAAAC ACTGATTTTA AAAGTTTACA H4Q 

50 - TTCAAATGTG TATTCAAAAG AAGTACTGAT TTGTAATTAT TATAGTTTGT GTGTATCATC 12Q0 

CCCTTTTAAC CGTGCCTAAC AACIGTACTT AAATTTTGTT TTCCTAGTGT AACAAATGTT 1250 

TCCCATAAGA TTTTCTAGAG CCAAATAATG GGAGTGAAAA ATTCCTTAAG TGTTATATAA 1320 

55 

GAAAATATAT TAGAAAATCA GCTTTGGATT ATACGATTTC TAAAATATAC TAATACAGAA 1380 

TCCTCAGTAA TATGTTTTGA ATTGGA.TTTT TTCTCAGAAC TGTTACATAA TAAATAATAC 1440 

60 ATCAACCAGA AAAAAAAAAA AAAAAAATTN C 1471 
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(2) INF ORMAT ION FOR SEQ ID MO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1402 base pairs 

(B) TYPE: nucleic acid 
{.O STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 22: 

AGGGACGTCT TGCCTGAGGA GATGCCCATT TCTGTCCTGG RTTACCCTCA CTGCGTGGTG 

CATGAGCTGC CAGAGCTGAC GGCGGAGAGT TTGGAAGCAG GTGACAGTAA CCAATTTTGC 

TGGAGGAACC TCTTTTCTTG TATCAATCTG CTTCGGATCT TGAAOA.GCT GACAAAGTGG 

AAGCATTCAA GGACAATGAT GCTGGTGGTG TTCAAGTCAC CCCCCATCTT GAAGCGGGCC 

CTAAAGGTGA AACAAGCCAT GATGCAGCTC TATGTGCTGA AGCTGCTCAA GGTACAGACC 

AAATACTTGG GGCGGCAGTG GCGAAAGAGC AACA.TGAAGA CCATGTCTGC CATCTACCAG 

AAGGTGCGGC ATCGGCTGAA CGACGACTGG GCATACGGCA ATGATCTTGA TGCCCGGCCT 

TGGGACTTCC AGGCAGAGGA GTGTGCCCTT CGTGCCAACA TTGAACGCTT ' CAACGCCCGG 

CGCTATGACC GGGCCCACAG CAACCCTGAC TTCCTGCCAG TGGACAA.CTG CCTGCAGAGT 

GTCCTGOGCC AACGGGTGGA CCTCCCTGAG GA.CTTTCAGA TGAACTATGA CCTCTGGTTA 

GAAAGGGAGG TCTTCTCCAA GCCCATTTCC TGGGAAGAGC TGCTGCAGTG AGGCTGTTGG 

TTAGGGGACT GAAATGGAGA GAAAAGATGA TCTGAAGGTA CCTGTGGGAC TGTCCTAGTT 

CATTGCTGCA GTGCTCCCAT CCCCCACCAG GTGGCAGCAC AGCCCCACTG TGTCTTCCGC 

AGTCTGTCCT GGGCTTGGGT GAGCCCAGCT TGACCTCCCC TTGGTTCCCA GGGTCCTGCT 

CCGAAGCAGT CATCTCTGCC TGAGATCCAT TCTTCCTTTA WTTCCCCCAM CCTCCTCTCT 

TGGATATGGT TGGTTTTGGC TCATTTCACA ATCAGCCCAA GGYTGGGAAA- GCTGGAATGG 

GATGGGAACC CCTCCGCCGT GCATCTRAAT TTCAGGGGTC ATGCTGATGC CTCTCGAGAC 

ATACAAATCC TTGCCTTTGT OGCTTGCAA AGGAGGAGAG TTTAGGATTA GGGCCAGGGC 

CAGAAA.GTCG GTATCTTGGT TGTGCTCTGG GGTGGGGGTG GGGTGTTTCT GATGTTATTC 

CAGCCTCCTG CTACATTATA TCCAGAAGTA ATTGC GGAGG CTCCTTCAGC TGCCTCAGCA. 

CTTTGATTTT GGACAGGGA.C AAGGTAGGAA GAGAAGCTTC CCTTAACCAG AGGGGCCATT 

TTTCCTTTTG GCTTTCGAGG GCCTGTAAAT ATCTATATAT AATTCTGTGT GTATTCTGTG 

TCATGTTGGG GTTTTTAATG TGATTGTGTA TTCTGTTTAC ATTAAAAAGA AGCAAAAATA. 
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ATAAAAAAAA AAAAAAAAAA CT 14 q 9 



(2) INFORMATION FOP. SEQ ID MO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1047 base pairs 
10 (B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 23 : 
GGCACAGGGG ACTACAGGCA CCCACGACGA . TACCCAGCTA ATTTTTGTAT TTTTTTGTAG 
• AGATGGGGTT TCACGATOTC GCCCAGGCTG GTCTTGAACT CCTGGGCTTG AGCGATCTTC 
20', CCATCTTTCC ATCTTGGCCT CCTAAAGTGC TGGGACTGCA GGCATGAGCC ACCATGCCCA 
GCCAAGATTC TTATTGATTA CCATGTTGCT TCAAGAAGCC AAGCCAGTTT CCAATATTCC 
CCATTTGCTG GAGTCTTGGT ACTTTGGGTA GAAGCAACTG GTAAATTGTT AATTGGAACA 



25 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic -acid 

(C) STRANDEDNESS : double 



60 
120 
130' 
240 
300 



NTTGGTGGTG TAGATAACCA CGTATGGCCA AACCTAGAGC ATCTAGGCTC ACAATTACTA 360 



TCCTGACTTG ATAACAAGTG TTCTGATATT AACCTGAAAA TGGGAATAAT GCCAAATCTG 420 

30 TGTAACTTAA CATCTATATA CACAGTGGGG AGAACTGAAG TT ATT AAAC C TGGAATCTCT 430 

GTGATCAAGG CTAACAGTAG TTATCTAAGA AGCAAAGGAC CTACAATTCT TAGACTTGGA 540 

GTCATATTCT TTAAGGACGT GTTCTGAAAC TATATCAAGC ATCTGGTTTC CACGTATTTC 600 

35 

( TCCCTCAGAA ATTATGAAGT ACAAGTAAAA ATGAAGGTAC AGGGTAAGAC ACATGCTGCT 660 

TTCTTGCTCT TGAGTGGAGA CAGTTTTCCA GCCATCTTAA CCCCTTWACA CAAAACAATT 720 

40 TGTGTTTTAT AGCAAATAAG TGACTCAACA TAATTTCAAT ATGATGTTTA TOCACCAGTA 730 

CTTTCCTTTC AGCTTCTAGT CCCATAARTG GTTTGTGAAG TCATCGGTTA CATTAGCCAA 840 

GATAGGCCTA GACTTGAAGT CTAGAATGTT TTTCCCACTA TATGCCAAAG TAGAATGTGG 900 

45 

GTATCTCAGG GTCATTTTTG TTGTTCAATT TCCCACCTGT ACAGTTGTTA TGATTCACTT 960 
TCCTTATGTG TCTAATAAAT CTTGTTCC AT " GAAATGATCA AAAAAAAAAA AAAAAAAACT ,• IQ20 
50 CGAGGGGGGG CCCGGTACCC AAATCGC '1047 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 
TTGGAAAGGG" TCTAGCTCTT TCTCATTCAC CAACTATATT AGAAGCACTT GA.GGGAAATT 
TACCACTCCA AATCCAAAGC AATGAACAGT CTTTTCTGGA TGATTTTATT GCCTGTGTCC 
CAGGATCAAG TGGTGGAAGG CTTGCAAGGT GGCTTCAGCC AGATTCATAT GCGGATCCTC 
AGAAAACA.TC TTTGATCCTG GAATAAGGAT GATATTCGTT GTGGTTGGCC TACCACCATA 
ACTGTTCAAA CAAAAGACC A GTATGGGGAT GTGGTACATG TTCCCAATAT GAAGGTAATT 
ATAACTGGAT TAAATTAGCA GACATCTATA TACTGGCTGC AATGACTGAT AAAATTTTAG 
AAATGCCAAG TGCTGAGRGT CCATTTGTTC TACCCTCTTT ATATAAAGGG TGATGCTGAA 
AGTTTGTTTA AATGACTTGT TTATATTAAT TAGTCCCCAA GTGTCGAAGT TACACCTGTT 
TTTTTTGTGA GTTTGTTCTT TACATTTTGC TACCTGTTAC GGGGACTGAA AGGAGGGATA 
AGAAAGTATC CATCTAAAGA GTGCTAGAGA CATACAGTGA. AGGCCCTCAA TATGTATTGA 
TTGAATAAAT GCATGAAAGA ATACATTT T T AAATTTTGTG TATAGTTTTG AAAGACTCAA 
GTACGTTCTG TGTTTGGTAT TACTGAAACC ACATTTTAAA AATAACACTC ATTAAGTTAG 
AAATA.TATGA GTTTAGATTG TAAAAGAATG AGGAA.TTGAA ATAGTTGTAT ACCATATTGA 
TGAATATAGA GTTTTTAGGA TACCTGTTAC CTGAAATATT AATAATAATG TTTNCAGAGC 
ATATTATACA TAATTATTTG TGATTTAATC TGTTAATATG AATATCTCAT TTAAAACTTT 
TA.TTTCTGAA AAAATTATAT TGAATAAAA.T TTTATATAGG CAGTCCCCAG CCCTTTCCTC 
CTTCAAAGTT GTCTTATAGA GTG ATTGGTT 

(2) INFORMATION FOR SZQ ID NO: 25: ' 

Ci) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1203 base pairs 
(3) TYPE : nucleic acid 

(C) STRANuEDNESS : _ double 

(D) TOPOLCGY: linear 

(xi) SEQUENCE DESCRIPTION: SZQ ID MO: 25: 
TAATCGCTAC TATAGGGAAA GCTGGTCGCT GCAGGTACCG GTCCGGAATT CCGGGTCGAC 
CCACGCGTCC GAGGGAAATG GCGCCTCCGG CCCCCGGCCC GGCCTCCGGC GGCTCCGGGG 
AGGTAGACGA GCTGTTCGAC GTAAAGAACG CCTTCTACAT CGGCAGCTAC CAGCAGTGO, 
TAAACGAGGC GCASGGGTGA AGCTRTCAAG CCCAGAGAGA GACGTGGAGA GGGACGTCTT 
CCTGTATAGA GCGTAOCTGG CGCAGAGGAA GTTCGGTGTG GTCCTGGATG AGATCAAGCC 
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CTCCTCGGCC CCTGAGCTCC AGGCCGTGCG .CATGTTTGCT GACTACCTCG CCCACGAGAG 360 . 

TGGGAGGGAC AGCATCGTGG CCGAGCTGGA CCGAGAGATG- AGCAGGAGCK TGGACGTGAC 420 

5 

CAACACCACC TTCCTGCTCA TGGCCGCCTC CATCTATCTC CACGACCAGA ACCCGGATGC 480 

CGCCCTC-CGT GCGCTGCACC AGGGGGACAG CCTGGAGTGC ACAGCGATGA CAGTGCAGAT 540 

10 CCTGCTGAAG CTGGACCGCC TGGACCTCGC CCGGAAGGAG CTGAAGAGAA TGCAGGACCT 600 

. GGACGAGGAT GCCACCCTCA CCCAGCTCGC CACTGCCTGG GTCAGCGTGG CCACGGGTGG 660 

TGAGAAGGTG CAGGATGGCT ACTACATCTT CCAGGAGATG GGTGACAAGT GCTCGCCCAC' 720 

CCTGCTGCTG CTCAATGGGC AGGCGGCCTG CCACATGGCC CAGGGCCGCT GGGAGGCCGC 730 

TGAGGGCCTG CTGCA.GGAGG CGCTAGACAA GGATAGTGGC TACCCRGAGA CGCTGGTCAA 340 

20 CCTCATCGTC CTGTCC CAGC ACCTKGGCAA GCCCCCTGAG GTGACAAACC GATACCTGTC 900 

CCAGCTGAAG GATGGGCACA GGTCCCATCC CTTGATCAAG GAGTACCAGG CCAAGGAGAA 960 

CGACTTTGAC AGGGTGGTGC TACAGTACGC TCCCAGCGCT GAGGCTGGCC CAGAGCTGTC 1020 ■ 

AGGACCATGA AGCCAGGACA GAGGCCAGGA GCCAGCCCTG CAGCCCTCCC CACGCGGCAT 1080 

CCACCTGCAT CCCTCTGGGG CAGGAGCCCA CCCCCAGCAC CCCCATCTGT TAATAAATAT 1140 

30 CTCAACTCCA RGGTGTTCCA CCTGAAAAAA AAAAAAAAAA. AAAAAAAAAA AAAAAAAAAA 1200 

AAAAAAAA 1208 



15 



25 



35 



' (2) INFORMATION FOR SEQ ID NO: '26: 



(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 1922 base pairs 

(3) TTPE: nucleic acid 

(C) STRANDEDNE3S : double 

(D) TOPOLOGY : linear 

45 (XX) SEQUENCE DESCRIPTION : SEQ ID NO: 26.: 

GTGCTGCGCT ACTGAGCAGC GCCATGGAGG ACTCTGAAGC ACTGGGCTTC GAACACATGG 60 

GCCTCGATCC CCGGCTCCTT CAGGCTGTCA CCGATCTGGG CTGGTCGCGA CCTACGCTGA 120 

50 

TCCAGGAGAA GGCCATCCCA CTGGCCCTAG AAGGGAAGGA CCTCCTGGCT CGGGCCCGCA 130 

CGGGCTCCGG GAAGACGGCC GCTTATGCTA TTCCGATGCT GCAGCTGTTG CTCCATAGGA 240 

55 AGGOGACAGG TCCGGTGGTA GAACAGGCAG TGAGAGGGCT . TGTTCTTGTT CCTACCAAGG 300 

AGCTGGCACG GCAAGCACAG TCCATGATTC AGCAGCTGGC TACCTACTGT GCTCGGGATG 36Q 

TCCGAGTGGC CAATGTCTCA GCTGCTGAAG ACTCAGTCTC TCAGAGAGCT GTGCTGATGG 420 
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235 

AGAAGCCAGA TGTCGTAGTA GGGACCCCAT CTCGCATATT AAGCCACTTG CAGCAAGACA 430 

GCCTGAAACT TCGTGACTCC CTGGAGCTTT TGGTGGTGGA CGAAGCTGAC CTTCTTTTTT 540 

CCTTTGGCTT TGAAGAAGAG CTCAAGAGTC TCCTCTGTCA CTTGCCCCGG ATTTACCAGG 50 Q 

CTTTTCTCAT GTCAGCTACT . TTTAACGAGG ACGTACAAGC ACTCAAGGAG CTGATATTAC 660 
ATAACCCGGT TACCCTTAAG TTACAGGAGT CCCAGCTGCC TGGGCCAGAC CAGTTACAGC - 720 

AGTTTCAGGT GGTCTGTGAG ACTGAGGAAG ACAAATTCCT CCTGCTGTAT GCCCTGCTCA 730 

AGCTGTCATT GATTCGGGGC AAGTCTCTGC TCTTTGTCAA CACTCTAGAA CGGAGTTACC 340 
15 GGCTACGCCT GTTCTTGGAA CAGTTCAGCA TCCCCACCTG TGTGCTCAAT GGAGAGCTTC ' 900 

CACTGCGCTC CAGGTGCCAC ATCATCTCAC AGTTCAACCA AGGCTTCTAC GACTGTGTCA . 960 

TAGCAACTGA TGCTGAAGTC CTGGGGGCCC CAGTCAAGGG CAAGCGTCGG GGCCGAGGGC 1020 

20 

• CMAAAGGGGA CAAGGCCTCT GATCCGGAAG CAGGTGTGGC CCGGGGCATA GACTTCCACC 1030 

ATGTGTCTGC TGTGCTCAAC TTTGATCTTC CCCCAACCCC TGAGGCCTAC ATCCATCGAG ' 1140 

25 CTGGCAGGAC AGCACGCGCT AACAACCCAG GCATAGTCTT AACCTTTGTG CTTCCCACGG 1200 

AGCAGTTCCA CTTAGGCAAG ATTGAGGAGC TTCTCAGTGG AGAGAACAGG GGGCCCATTC 1260 

TGCTCCCCTA CCAGTTCCGG ATGGAGGAGA TCGAGGGCTT CCGCTATCGC TGCAGGGATG 1220 

30 

CCATGGGCTC AGTGACTAAG CAGGCCATTC GGGAGGCAAG ATTGAAGGAG ATCAAGGAAG 1380 

AGGTTCTGCA TTCTGAGAAG CTTAAGACAT ACTTTGAAGA CAACCCTAGG GACCTCCAGC 1440 

35 TGCTGCGGCA TGACCTACCT TTGCACCCCG GAGTGGTGAA GCCCCACCTG GGCCA.TGTTC 1500 

CTGACTACCT GGTTCCTCCT GCTCTCCGTG GGCTGGTRCG CCCTCACAAG AAGCGGAAGA 1560 

AGCTGTCTTC CTCTTGTAGG AAGGCCAAGA GAGCAAAGTC CCAGAACCCA CTGCGCAGCT 1620 

40 

. TGAAGCACAA AGGAAAGAAA TTCAGACCCA CAGCCAAGCC CTCCTGAGGT TGTTGGGCCT 1630 

CTCTGGAGCT GAGCACATTG TGGAGCACAG GCTTACACCC TTCGTGGACA GGCGAGGCTC 17 40 

.45 TGGTGCTTAC TGGACAGCCT GAACAGACAG TTCTGGGGCC GGCAGTGCTG GGCCCTTTAG 1300 

CTCCTTGGCA CTTCCAAGCT GGCATCTTGC CCCTTGACAA CAGAATAAAA ATTTTAGCTG 1360 
CCCCAAAAAA AAAAAAAAAA AAAAAAACTO GAGGGGGGGC CCGTACCCAA TTCGCCCTAT \ 192Q 

AA 1322 



50 



55 

(2) INFORMATION FOR 5EQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEMGTH: 1951 base pairs 
60 (3) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TQPOLCGY: linear 

txi)' SEQUENCE DESCRIPTION : SEQ ID MO: 27: 
TCGTCCCCAG AGCGGGCTGA GCCCCAGGCG SAGGGTGGCG GGGGAGCCTG GGGGAGCCGC 
CGCCACCTCC ACGGGCCTCT CTGAGCTCGG ACACCAGCGC CCTGTCCTAT GACTCTGTCA 

' AGTACACGCT GGTGGTAGAT GAGCATGCAC A.GCTGGAGCT GGTGAGCCTG CGCCGTGCTT 
GGGAGACTAC AGTGACGAGA. GTGACTCTGC CACCGTCTAT GACAACTGTG CCTCCGTCTC 
CTCGCCCTAT GAGTCGGCCA TCGGAGAGGA ATA.TGAGGAG GCCCCGCGGC CCCAGCCCCC 
TGCCTGCCTC TCCGAGGAAC TCCACGCCTG ATGAACCCGA CGTCCATTTC TCCAAGAAAT 
TCCTGAACGT YTTCATGAGT GGCCGCTCCC GCTCCTCCAG TGCTGAGTCC TTCGGGCTGT 
TCTCCTGCAT CATCAACGGG GAGGAGCAGG AGCAGACCCA CCGGGCCATA TTCAGGTTTG 
TGCCTCGACA GGAAGACGAA CTTGAGCTGG AAGTGGATGA CCCTCTGCTA GTGGAGCTCC 

■ AGGCTGAAGA CTACTGGTAC GACGC CTACA ACATGCGCA.C TGGTGCCGGG GGTGTCTTTC 
CTGCCTATTA CGCCATCGAG GTCACCAAGG AfiCCCGAGCA CATGGCAGCC CTGGCCAAAA - 
ACAGTGACTG GGTGGACCAG TTC CGGGTG A AGTTCGTGGG CTCAGTCCAG GTTCCCTATC 
ACAAGGGCAA TGACGTCGTC TGTGCTGCTA TGCAAAAGA.T TGCCACCACC CGCCGGCTCA 
CCGTGCACTT TAACCCGCCC TCCAGCTGTG TCCTGGAGAT CAGCGTGCGG GGTGTGAAGA 
TAGGCGTCAA GGCCGATGAC TCCCAGGAGG CCAAGGGGAA TAAATGTAGC CACTTTTTCC 
A.GTTAAAAAA CATCTCTTTC TGCGGATATC ATCCAAAGAA. CAACAAGTAC TTTGGGTTCA . 
TCACCAAGCA • CCCGGGCGAC CACCGGTTTG CCTGCCACGT CTTTGTGTCT GAAGACTCCA 
GCAAA.GCGCT GGCAGAGTCC GTGGGGAGAG CATTCCAGCA GTTCTACAA.G CAGTTTGXGG 
AGTACACGTG CCCCACAGAA GATATCTACC TGGAGTAGCT GTGCAGCCCC GCCCTCTGCG 
TCCCCCAGGC CTCAGGCCAG TGCCAGGACA GCTGGCTGCT GACAGGATGT GGCACTGCTT 
GAGGAGGGGC ACCTGCCACC GCCAGAGGAC AAGGAAGTGG GGCGCTGGCC CAGGGTAGGG 
GAGGGTGGGG CAATGGGGAG AGGCAAATGC AGTTTATTGT AATATATGGG ATTAGATTCA 
TCTATGGAGG GCAGAGTGGG CTGCCTGGGG ATTGGGAGGG ACAGGGCTTC GGGAGCAGGT 
CTCTGGCAGA GAAGGATGTC CGTTCCAGGA GCACACGGCC CTGCCCCATC CTGGGCCTTA 
CCTCCCCTGC CAGGGCTCGG GCGCTGTGGC TCCTGCCTTG ATGAAGCCCG TGTCCTGCCT 
TGATGAAGCC TGTGCCACCT GCAAGTGCCC GCCCTGCCCC TGCCCCAACC CCCACCGAAG 
AGCCCTGAGC TCAGGCTGAG CCCAGCCACC TCCCAAGGAC TTTCC a JGTGA GG?AATGOCA 
ACACGTGGAG GTGAAGTCCC TGTTCTCAGC TCCGTCATCT GCGGGGCTTC TGGGTGGCTC 
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CTGCCACTGA CCTCACCGGC ATGCTGGCCT GTGGCAGGCC TAGGACCTCA C-COCC-GGAGG 1740 

AGGAGCTGCC GCAAGGCCCT GTCCCAGCAG AAGAGGGAGG CTTCCTGACT GACACAGGCC 1300 

AGCCCCATCT TGGTCCTGTC ACCCTGGCCC CAACTA.TTAA AGTGCCATTT CCTGTCAAAA 1360 

AAAAAAAAAA AAAATCGGGG GGGGCCCGGA ANCCAATTTC CCCC^AAAAG C-GGC-GTTATA 1920 

AAAATTCCCN GGCNGTGTTT TTAAAAATTC G _ 1951 

(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE GiARACTERISTICS : 

(A) LENGTH : 3989 base pairs. 
(3) TY?E: nucleic a.cid 

(C) STRAMDEDNESS : doable 

(D) • "TOPOLOGY: linear 

(:d_) SEQUENCE DESCRIPTION: SEQ ID MO : 23: 

GGCACAGGCC GCAGGGNACC TATGGGCGCA TATAGGTTGT , AATGAAACTG TAGTCTCAGT 60 

TGGAAGCCTA GACATGAAAT GGGTCAGTGA GCAAGGCTCT ATTCCTAGTC TCCAGCCATG ' 120 

f CCTGTGGAAC CTGARCCCRC TCTC^GCACA TTGGACCCAG GCAGATGYAA AAAATTCACA 130 ' 

GAACTATGAT TTGGACTCAA GGGTTTGTAG ATTTCCTCCT TCATTCTAAT TTCAGTGTCT 240 

AAAATTCTTG CATCCRTGAA CGAGCTGGGC ATTTGATGAG ACAGGGCVGA ATACTGCAGT 300 

TTTCCTCCTA GAAATCATCT GGGGCATTTT CTTTGAACTG ATGGGAACAA TAAGGCATAA 36 Q 

CTGTTTGCAC AAACTTGGGA TAARTGATTT TGGGATAACG ATCTACCAGA ATGGGGATAT 420 

TTCACCCTTG GTTCTGAGAT GCAAAC CAAA GAATATCATG ACCAGCTTTC AGGCCTCCTG 430 

AAGTATATCT CTCACATTGT CCTGTTCTCA TGCTGAGGAG CCTGAGATCC CTGTGTGGGG 540 

ATTAGACAGT GGACTGTTAT GGGTGTAGGT GAATTGGCTT ATTTTGTCTG TCCCTGTCTG 600 

AATGTATTGC AGGAAYTAAA AAGGACCAAG AAGAGGAAGA AGACCAAGGC CCACCATGCC 660 

CCAGGCTCAG CAGGGAGCTG CTGGAGGTAG TAGAGCCTGA AGTCTTGCAG GACTCACTGG 720 

ATACATGTTA TTCAACTCCT TCCAGTTGTC TTGAACAGCC TGACTCCTGC C\GCCCTATG 730 

GAAGTTCCTT TTATGCATTG GAGGAAAAAC ATGTTGGCTT TTCTCTTGAC GTGGGAGAAA 340 

TTGAAAAGAA GGGGAAGGGG AAGAAAAGAA GGGGAAGAAG ATCAAAGAAG GAAAGAAGAA 900 

GGGGAAGAAA AGAAGGGGAA GAAGATCAAA ACCCACCATG CCCCAGGCTC AGCAGGGAGC 960 

TGCTGGATGA GAAAGRGCCT GAAGTCTTGC AGGACTCACT GGATAGATGT TATTCAACTC 102 0 

CTTCAGTTGT GTTGAACTGT GTGACTCATG CCAGCCCTAC AGAAGTGCCT TTTATGTATT 108jO 
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GGAGCAACAG CATGTTGGCT TGGCTGTTGA CATGGATGAA ATTGAAAAGT ACCAAGAAGT 1140 

-GGAAGAAGAC CAAGACCCAT CATGCCCCAG- GCTCAGCAGG GAGCTGCTGG ATGAGAAAGA -1200 

GCCTGAAGTC TTGCAGGACT CACTGGATAG ATGTTATT CG ACTCCTTCAG GTTATCTTGA -1260 

r 

ACTGCCTGAC TTAGGCCAGC C CT AC AGO AG TGCXGTTTAC TCATTGGAGG AMCAJCEACCT 1320 
"TGGCTTKKCT CTTGACGTGG ASAAATTGAA AAGAAGGGGA AGGGGAARAA AAGAAGGGGA . 1330 

AGAAGATCAA AGAAGGAAAG AAGAAGGGGA AGAAAAGAAG GGGAAGAAGA TCAAAACC CA 1440 

CCATGCCCCA GGCTCAGCAG GGAGCTGCTG GATGAGAAAG GGCCTGAAGT CTTGCAGGAC .1500 

TCACTGGATA GATGTTATTC AACTCCTTCA GGTTGTCTTG AACTGACTGA CTCATGCCAG 1560 

CCCTACA.GAA GTGCCTTTTA TRTATTGGAG CAACAGYGTG TTGGCTTGGC TGTTGACATG 1620 
GATGAAATTG AAAAGTACCA AGAAGTGGAA GAAGACCAAG ACCCATCATG CCCCAGGCTO ■ 1630' 

AGCAGGGAGC TGCTGGATGA GAAAGAGCCT GAAGTCTTGC AGGACTCACT GGATAGATGT 1740 

TATTCGACTC CTTC^GGTTA TCTTGAACTG CCTGACTTAG GCCAGCCCTA CAGCAGTGCT 1300 

GTTTACTCAT TGGA3GAACA GTACCTTGGC TTGGGTCTTG ACGTGGACAG AATTAAAAAG I860 

GACCAAGAAG . AGGAAGAAGA CCAAGGCCCA ' CCATGCCCCA GGCTCAGCAG GGAGCTGCTG 19 20 

GAGGTAGTAG AGCCTGAAGT CTTGCAGGAC TCACTGGATA GATGTTATTC AACTCCTTCC 1930 

AGTTGTCTTG AACAGCCTGA CTCCTGCCAG CCCTATGGAA GTTCCTTTTA TGCATTGGAG 2040 

GAAAAACATG TTGGCTTTTC TCTTGACGTG GGAGAAATTG AAAAGAAGGG GAAGGGGAAG 2100 

AAAA.GAAGGG GAAGAAGATC AAMGAAGRAA AGAAGAAGGG GAAGAAAAGA AGGGGAAGAA 2160 

GATCAAAACC CACCATGCCC CAGGCTCAAC GGCGTGCTGA TGGAAGTGGA AG AGC STGAA 2220 

GTCTTACAGG ACTCACTGGA TAGATGTTAT TCGACTCCGT CAATGTACTT TGAACTACCT 2230 

■ GACTCATTCC AGCACTACAG AAGTGTGTTT TACTCATTTG AGGAACAGCA CATOGCTTC 2340 

GCCCTTTACG TGGACAATAG GTTTTTTACT TTGACGGTGA CAAGTCTCCA CCTGGTGTTC 2400 

CAGATGGGAG TCATATTCCC ACAATAAGCA GCCCTTASTA. AXCCGAGAGA TGTCATTCCT 2460 

GCAGGCAGGA C CT AT AGGCA MGTGAAGATT TGAATGAAAG TACAGTTCCA TTT.GGAAGCC 2520 

CAGACATAGG ATGGGTCAGT GGGCATGGCT CTATTCCTAT TCTCAAACCA TGCCAGTGGC 2530 

AACCTGTGCT CAGTCTGAAG ACAATGGACC CACGTTAGGT GTGACACGTT CACATAACTG 2540 

TGCAGCACAT GCCGGGAGTG ATCAGTCRGA CATTTTAATT TGAACCACGT ATCTCTGGGT 2700 

AGCTACAAAA TTGCTCAGGG ATTTCATTTT GC AGGCATGT CTCTGAGCTT CTATACCTGC 2760 

TCAAGGTCAK TGTCATCTTT GTGTTTAGCT CATCCAAAGG TGTTACCCTG GTTTCAATGA 2820 

ACCTAACCTC ATTCTTTGTG TCTTCAGTGT TGGCTTGTTT TAGCTGATCC ATCTGTAACA 2380 
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GAGGAGGGAT CCTTGGCTGA GGATTGTATT TCAGAACCAC CAACTGCTCT TGACAATTGT 2940 

TAACCCGCTA GRCTCCTTTG GTTAGACAA.G CCACAGTCCT " TCAGCCTCCA ATTGGTGTCA 3000 

5 GTACTTAGGA AGACCACAGC TAGATGGACA. AACAGCATTG GGAGGCCTTA GCCCTGCTCC 3060 

TCTCPATTCC ATCCTGTAGA ■ GA.ACAGGAGT CAGGAGCCGC TGGCAGGAGA CAGCATGTCA 3120 

CGCAGGACTC TGCGGGTGCA GAA.TATGAAC AAY GCCATGT TCTTGCAGAA AACGCTTAGC 3130 

10 

CTGAGTTTCA TAGGAGGTAA TCACCAGACA ACTGCAGAAT GTRGAECACT GAGCAGGACA - 3240 

GCTGACCTGT CTCCTTCACA TAGTC CATF.T C^CO.CAAAT CACACAAOA. AAA.GGAGARG 3300 

15 AGATATTTTG GGTTCAAAAA AAGTAAAAAG ATAATGTAGC TGCATTTCTT TAGTTATTTT 3360 

GARCCCCAAA TATTTCCTCA TCTTTTTGTT GTTGTCATKG ATGGTGGTGA CATGGACTTG 3420 

TTTATAGAGG ACAGGTCAGC TGTCTGGCTC AGTGATCTAG ATTCTGAAGT TGTCTGAAAA 3430 

20 ... 

TGTCTTCATG ATTAAATTCA GCCTAAACGT TTTGCCGGGA ACACTGCAGA GACAATGCTG 3540 

TGAGTTTCCA ACCTYAGCGC ATCTGCGGGC AGAGAAGGTC TAGTTTGTCC ATCASCATTA - 3600 

25 TCATGATATC AGGACTGGTT ACTTGGTTAA -GGAGGGGTCT AGGAGATCTG TCCCTTTTAG 3 560 

AGACACCTTA CTTATAATGA AGTATTTGGG% AGGGTGGTTT TCAAAATTAG AAATGTCCTG 3720 

TATTCCHATG ATCATCCTGT AAACATTTTA TCATTTATTA ATCATCGCTG CCTGTGTCTA 3730 

30 

TTATTATATT CATATCTCTA CGCTGGAAAC TTTCTGCCTC AATGTTTACT GTGCCTTTGT 3340 

TTTTGCTAGT GTGTGTTGTT GAAAAAAAAA ACATTCTCTG CCTGAGTTTT AATTTTTGTC. 3900 
35 CAAAGTTATT TTAATCTATA CAATTAAAAG CTTTTGCCTA TCA^AAAAAA AAAAAAAAAA. ' 3960 

AAAAAAAAAA AAAAAGCGGA CGCGTGGGC 3939 

40 

(-2) IMFQfiMATION FOR SEQ ID NO; 29: 

(i) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH; 3735 base pairs 

(3) TYPE : nucleic acid 

(C) STKANDEDNESS : double 

(D) TOPOLCGY : linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 29: 

CTGCTGTTCG CTGGCTGGGC TCCGCAGCAG GCTTGGCCAG C3GCTGACGG GTCGGGGGGC 60 

GGGTTTGTGT GAACAGGCAC GCAGC TGC AG ATTTTATTCT GGTAGTGCAN CCCTCTCAAA 12 Q' 

55 

GGTTGAAGGA ACTGATGTAA CAGGGATTGA AGAAC-TAGTA ATTCCAAAAA AGAAAACTTG 130 

GGATAAAGTA GCCGTTCTTC AGGCACTTGC ATCCACAGTA AACAGGGATA CCACAGCTGT 240 

60 GCCTTATGTG TTTCAAGATG ATCCTTACCT TATGCCAGCA TCATCTTTGG AATCTCGTTC 300 
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ATTTTTACTG GCAAAGAAAT CCGGGGAGAA TGTGGCCAAG TTTATTATTA ATTCATACCC 360 
CAAATATTTT CAGAAGGACA TAGCTGAACC TC AT AT AC CG TGTTTAATGC CTGAGTACTT . 420 

5 

TGAACCTCAG ATCAAAGACA. TAAGTGAAGC CGCCCTGAAG GAACGAATTG AGCTCAGAAA 430 

AGTCAAAGCC TCTGTGGACA TGTTTGATCA GCTTTTGCAA GCAGGAACCA CTGTGTCTCT 540 

10 TGAAACAAGA AATAGTCTCT TGGATTTWTT GTGTTACTAT GGTGACCAGG AGCCCTCAAC 600 

TGATTACCAT TTTCAACAAA CTGGACAGTC AGAAGCATTG GAAGAGGAAA. ATGA.TGAGAC 660 

ATCTAGGAC-G AAAGCTGGTC ATCAGTTTGG AGTTACATGG CGAGCAAAAA ACAAC'GCTGA. 720 

15 

GAGAATCTTT TCTCTAATGC CAGAGAAAAA TGAACATTCC TATTGCACAA TGATCCGAGG 780 

AATGGTGAAG CACCGAGCTT ATGAGCAGGC ATTAAACTTG TACACTGAGT TACTAAACAA 340 

20 CAGACTCCAT GCTGATGTAT ACACATTTAA TGCATTGATT GAAGCAACA.G TATGTGCGAT '900 

AAATGAGAAA TTTGAGGAAA AATGGAGTAA AATACTGGAG CTGCTAAGAC ACATGGTTGC 960 ■ 

ACAGAAGGTG AAACCAAATC TTCAGACTTT TAATACCATT CTGAAATGTC TC CGAAGATT 1020 

25 

TCATGTGTTT GCAAGATCGC CAGCCTTAC A . GGTTTTACGT GAAATGAAAG CCATTGGAAT 1030 

AGAACCCTCG CTTGCAACAT ATCACCATAT TATTCGCCTG TTTGATCAAC CTGGAGACCC 1140 

30 TTTAAAGAGA TCATCCTTCA TCATTTATGA TATAATGAAT GAATTAATGG GAAA.GAGATT 1200 

TTCTCCAAAG GACCCGGATG ATGATAAGTT TTTTCAGTCA GCCATGAGCA TATGCTCATC 12 SO 

TCTCAGAG^T CTAGAACTTG CCTACCAAGT ACATGGCCTT TT AAAAAC CG GAGACAACTG 1320" 

35 . 

GAAATTCATT GGACCTGATC AACATCGTAA TTTCTATTAT TCCAAGTTCT TCGATTTGAT 1330 

TTGTCTAATG GAACAAATTG ATGTTACCTT GAAGTGGTAT GAGGACCTGA TACCTTCAGC 1440 

40 CTACTTTCCC • CACTCCCAAA CAATGATACA TCTTCTCCAA GCATTGGATG TGGCCAATCG 1500 

GCTAGAAGTG ATTCCTAAAA TTTGGAAAGA TAGTAAAGAA TATGGTCATA CTTTCCGCAG 1560 

TGACCTGAGA GAAGAGATCC TGATGCTCAT GGCAAGGGAC AAGCACCCAC CAGAGCTTCA . 1620 

45 

GGTGGCATTT GCTGACTGTG CTGCTGATAT CAAATCTGCG TATGAAAGCC AACCCATCAG 1630 
ACAGACTGCT CAGGATTGGC CAGCCACCTC TCTCAACTGT ATAGCTATCC TCTTTTTAAG ; 1740 

50 GGCTGGGAGA ACTCAGGAAG CCTGGAAAAT GTTGGGGCTT TTCAGGAACC ATAATAAGAT 1300 

TCCTAGAAGT GAGTTGCTGA ATGAGCTTAT GGACAGTGCA AAAGTGTCTA ACAGCCCTTC 1360 

' CCAGGCCATT GAAGTAGTAG AGCTGGCAAG TGCCTTCAGC TTACCTATTT GTGAGGGCCT 1920 

55 ■ 

CACCCAGAGA GTAATGAGTG ATTTTGCAAT CAACCAGGAA CAAAAGGAAG C CCT AAGTAA 19S0 

TCTAACTGCA TTGACCAGTG ACAGTGATAC TGACAGCAGC AGTGACAGCG ACAGTGACAC 2040 

60 CAGTGAAGGC AAATGAAAGT GGAGATTCAG GAGCAGCAAT GGTCTCACCA TAGCTGCTGG 2100 
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AATCACA.CCT GACAACTGAG ATATACCAAT ATTTAACATT GTTACAAAGA AGAAAAGA.TA 2160 

CAGATTTGGT GAATTTGTTA CTGTGAGGTA CAGTCAGTAC ACAGCTGACT TATGTAGATT 2220 

5 

TAAGCTGCTA ATATGCTACT TAACCATCTA TTAATGCACC A.TTAAAGGCT TAGCATTTAA 2230 

GTAGCAACAT TGCGGTTTTC AGACACATGG TCAGGTCCAT GGCTCTTGTC ATCAGGATAA 2340 

10 GCCTGCACAC CTAGAGTGTC GGTGAGCTGA CCTCACGATG CTGTCCTCGT GCGATTGCCC 2400 

TCTCCTGCTG CTGGACTTCT GCCTTTGTTG GCCTGATQTG GTGCTGTGAT GCTGGTCCTT - 2460 

CATCTTAGGT GTTCATGCAG TTCTAACACA GTTGGGGTTG GGTGAATAGT TTCCCAATTT 2520 

15 

CAGG AT ATTT C GATGTCAG A AATAACGCAT CTTAGGAATG ACT AAA.CAAG A.TAATGGCAG 25 SO 

TTTAGGCTGC ACAACTGGTA AAATGACTGT AGATAAATGT TGTAATTAGT GTACACGTTT 2640 

20 GTATTTTTGT TAATATAGCC GCTGCCATAG TTTTCTAACT TGAACAGCCA TGAATGTTTC ' 2700 

ATGTCTCCCT TTTTTTTTTG TCTATAGCTG TTACCTATTT TAGTGGTTGA AATGAGAGCT 27 SO 

AGTGATGACA GAAGGATGTG GAATGTCTTC TTGACATCAT TGTGTATTGC TGGTAATCAA 2320 

GTTGGTAACG ACTACTTCTA GCAGCTCTTA CCACTATGAC TTAAGTGGTC CTGGAAGGCA 2330 

GTAAGTGGAG GTTTGCAGCA TTCCTGCCTT CATGAGGGCT TCTACCACTG ACCACTTTGC 2940 

30 ACGTACCTGG CTCCCAGATT TACTTAGGTA CCCGACGAGT CGTCCACATA AGCAGCTTCA 3000 

TCTTTACCTT GCCAGAGTTG ACAATTATGG GATACTCTAG TCTACTTATA CTTGTGTTCC 3060 

CATCTCTCTG CCATCCTCTG AAGGCCAGGA CCCAGTCATA CATCCTTAGA' AACCAAAGTA ' 3120 

35 

TGGTTTTTGT 'XTTCTCTTGG AATGTCAGGT CTTAAGGCAT TTAATTGAGG GACAAAAAAA 3X80 

AAAAAAAGCC GATATAGTAG CTAGCTACTT AAGCATCCAT GGGTATTGCT CCATATCAAA 3240 

40 GCAGATTTGC AGGACAGAAA GAGTAAATTA GCCTTCAGTC TTGCTTTACA GCTTCCAAGG '3300 

AGAGCCTTGG CCACCTGAAA TGTTAACTCG GTCCCTTCCT GTCTCTAGTT CATCAGCACC , 3350 

' TGCAGATGCC TGACTCTTGT TAGCCTTACT ATTCAATACA GTCCTTAGAT TCACGGTATG '3420 

45 

OCTCTTCCTA TCCAGGCACC TATTCTGAAT CACCATCTTG. CTCTGCAGC7 AGAGTTGATA 3430 

' GGAGAAAATC CATTTGGGTA GATGGCCTAT GAATTTGTAG TAGACTTTCA AAA.TGAGTGA 3S40 

50 TTTGTTAGCT TGGTACTTTT AAGTTTGTGG 7ACAGATCCT CCAAACCCAT ACTCTGAGO. 3500 

ATTAACTGCC TTGAACATAG AGAAAATTAA GGCCTCACAG GATGAGTCTC CATTCTCTGT 3660 

AAATGCTTAT TTTATCATAG TCTTTAGCCN CTACTATGAG TAAAATGTTC TCTTCMGCCG 3720 

GGTGTGGTGA CTCAC 3735 



55 



60 



WO 98/54963 



290 



PCT/US98/11422 



(2) INFORMATION FOR SEQ ID MO: 30: 

(i) SEQUENCE CHAHACTSH.ISTIC5 : 

(A) LENGTH: lb 67 base pairs ~» 
5 (B) TYPE: nucleic acid 

(C) STHANDEDNE33 :. double 

(D) TOPOLOGY: linear 

SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

10 

TAGTAATTCA TTTAACTCCT CTTACATGAG TAGCGACAAT GAGTCAGATA TCGAAGATGA 60 
AGACTTAAAG TTAGAGCTGC GACGACTACG AGATAAACAT CTCAAAGAGA TTCAGGACCT 120 



15 GCAGAGTCGC CAGAAGCATG AAATTGAATC TTTGTATACC AAACTGGGCA AGGTGCCCCC ■ 130 



TGCTGTTATT ATTCCCCCAG CTGCTCCCCT TTCAGGGAGA AGACGACGAC CCACTAAAAG 240 



CAAAGGCAGC AAATCTAGTC GAAGCAGTTC CTTGGGGAAT AAAAGCCCCC AGCTTTCAGG 300 

20 

TAACCTGTCT GGTCAGAGTG CAGCTTCAGT CTTGCACCCC CAGCAGACCC TCCACCCTCC 3 60 



TGGCAAGATC CCAGAGTCOG GGCAGAATCA GCTGTTACAG CCCCTTAAGC CATCTCCCTC 420 



25 CAGTGACAAC CTCTATTCAG CCTTCACCAG TGATGGTGCC ATTTCAGTAC CAAGCCTTTC 430 



* TGCTCCAGGT CAAGGAACCA GCAGCACAAA CACTGTTGGG GCAACAGTGA ACAGCCAAGC 540 



CGCCCAAGCT CAGCCTCCTG CCATGACGTC CAGCAGGAAG GGCACATTCA CAGATGACTT 600 

30 

GCACAAGTTG GTAGACAATT GGGCCCGAGA TGCCATGAAT CTCTCAGGCA GGAGAGGAAG 660 



CAAAGGGCAC ATGAATTATG AGGGCCCtGG AATGGCAAGG AAGTTCTCTG- CACCTGGGCA 720 
"35 ACTGTGCATC TCCATGACCT CGAACCTGGG TGGCTCTGCC CCCATCTCTG CAGCATCAGC 730 



■ TACCTCTCTA GGTCACTTCA CCAAGTCTAT GTGCCCCCCA CAGCAGTATG GCTTTCCAGC 340 



TACCCCATTT GGCGCTCAAT GGAGTGGGA.C GGGTGGCCCA GCAGCACAGC CACTTGGCCA 900 

40 

GTTCCAACCT GTGGGAACTG CCTCCTTCCA GAATTTCAAC ATCAGCAATT TGCAGAAATC 96Q . 



CATCAGCAAC CCCCCAGGCT CCAAGCTGCG GACCACTTAG ACCTAGAGAC ATTAACTGAA 1020 



45 TAGATCTGGG GGCAGGAGAT GGAATGCTGA GGGGGTGGGT GGGGGTGGGA AGTAGCCTAT 1080 



ATACTAACTA . CTAGTGCTGC ATTTAACTGG TTATTTCTTG CCAGAGGGGA -ATGTTTTTAA 1140 



TACTGCATTG AGCCCTCAGA ATGGAGAGTC TCCCCCGCTC CAGTTATTGG AATGGGAGAG 1200 

50 

GAAGGAAAGA ACAGCTTTTT TGTCAAGGGG CAGCTTCAGA CCATGCTTTC CTGTTTATCT 12S0 



ATACTCAGTA ATGAGGATGA GGGCTAGGAA AGTCTTGTTC ATAAGGAAGC TGGAGAACTC 1320 



55 AATGTAAAAT CAAACCCATC" TGTAATTTCG AGTGGGTOGA GCTCTTGCTT TTGGTACATG 1380 



CCCTGAATCC CTCACTCCCT CAAGAATCCG AACCACAGGA CWAACGC CTACTGGGCT 1440 
CTCTCCTACC CIGCCCTCCT CCCTTTTTTT TACCCCTCTC TTTTTTATTT TTTCTTTGCT 1500." 
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CTTTAGAACC C-X}TGAAAAA TACCAGGGTA CTGGGGTGCA ACTCTTTCTT ATGATAGGTC 155Q 
ATTAGTGCTT TAACCAAAAG ATATTAGCAG CTTTGACTGC AGCATTAGCA ATTAGGRAAA 1520 
AAAA&AANWft. AAAACTCGAG GGGGGGCCCG ^^^C-EaaT ~TCC<:C'GT* " To 67 ~ 



10 (2) INFOEHAT ION FOR SEQ ID NO: 31: 

(i) SSQUEHCS CHMUtfrrsaXSTICS : 

(A) LENGTH: 1403 base oa^rs 
(3) TYPE: nucleic acid 
15 (CJ STRAMDEGNE33 : double 

(D) TOPCLCGY: "linear 

' SEQUENCE DESCRIPTION : SEQ ID NO: 31: 

20 ATTACACACC TGAGCACTGT GCCTGGCAAG ACCTGTCTTA -ATAGATTAGA GAAC C ACTGA 60 

TAGATGGTCA GCTTTCTGTA GCACTGAGAA CCCTACATTT CAAATGTGGA TAGCACCTTT 120 

GCGGGGAAAC ATCACTTGGC ACATCTGCAT TCTTTTTTGA CA2AGGGTCT CACTCTGTTG 130 

CCCAGGCTAG AGTGCATGGC ACGATCTTAG CTCACTGCAA CCTCCACCTC CCAAGTTCAA. 240 • 

GCGA'TTCTTC TGCCTCAGCC TCCTGAGCAG CTGGGATCAC AGACATGCGC TACCATGCCC 300 

30 ■ AGCTAATTTT TTGTATTTTT TGTKTGTTTG TTTTTGTTTK TAAGTAGAGA CGGGCTTTCA 360 

CCACGTTGGS CAGGCAGGTC TCGAACTCCT GAMCTCAGGT GATCCACCCA CATCTGCGTT 420 

CCAA.TATCTT TCTCAACATA ATGATAGCCG TAATTAATAT TTTCCAGTAC ATTTTTATGC 430 

35 

CTTTACACAC GAGAGTGGTA GACAGACACA AACCCAGATC TGTCTGACTC CAAAGCCCGT 540 , 

TFGTCATCAT TCCTTTTACG GTATCCTA.TA GTGGTATCCT TTAOGAAAG ACAGCTTTTA 500 

40 CCCAACAAAG ACTTAACTTC CCAGGATGCC AGAAGGACAA AGCGGGATTG CTTTTAAGRA 550 

GRAAGTTATC AAGAMCTTAT TTTATAAATG AGATT AGAT A GGGAAAGGCA ATTTATCTTT 720 

ATTAAAAACT GAAAAGGCCA GCATAGGGAA GGAGGTGCTT CGGTGGTCTT TTTCAGGGAA 73Q 

45 

■ ATACTTCAGT TGCTTTTATT AGAAACAGAT AGTACCTAAG GTTTTGAGGT AGGWACAGCT 340 

TAAGGCATGC TAATGKTCAT GGGTCCTTCC ATAGTCATTT TKGTATTTTG. GTTWACATTT 900 

50 GAGCAATAGG CAGCCCTTCA CTGCTGCTGG AYTCATTCCT GCCAYTATTA CAGGTGACAG 360- 

AGGAGACAGG AGGTATGTCT TTTCTATTTT TAWACATGCT TTATATTTAA CACAAGCTCT 1Q20 

TGGGTATCTT AGATAAACAG AAGTTGCCTA GCACTCCTTT TAGTGCATTG AACCCTTTAA 1080 

55 

CATTTAAGCA AAATAATAAA CAGTCTTTTG AGGTTCCTTA ACAATGAAAC GTGTTCGAGT 1140 

- GGCAGCAGCG GAATCCATGC YTCTTCTCCT GGAGTGTGCA AKAGTCCGTG GTCCTGAGTA 1200 

60 TCTCACACAG ATGTGCCATT TTATGTGTGA TGCTCTAATT AAGGCCATTG GTACAGAACC 1260 
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^ATTCAGAC GTCCTC7CAG AAATAATGCA TTCTTTTGCA AAGGTGAATA TTTTTCTCTT 1320 

ArAWAT.-rS TATAAGGTGG TATGTTCvTT TATTAGTCTT CCTAAAAAAA AAAAAAAAAA 1330 

5 

iJCTOSGrtSGG GGGGECZSGT ACCCAAT? 1403 



10 

(2) rNFCSl-KTICT ?C?. 3ZQ ID NO: 3 2:' 

(A) LSJG^H: 202 1 base pairs 

15 (3) TT?2 : nucleic acid 

•(C) ST3A23EDNZ5S ; double 

(D) TOPOLOGY: linear 

Ui) SZQUZ:;CZ DESCRIPTION : SEQ ID NO: 32: 

20 

AJGGATA.TGCA TGATTCOTAA CCAGGCTATA TGTTAAAAAA AAATTGGAAA ATGCAATAGA 60 

TTTTTTA-CTA TACAAACTA.C AGAATGAGTA TGCAAGTTTT ATTTATCAAA ATGTAATGGA 120 

25 rtTTT AAAgG CTGAGAAATT TTCCTTATAC t CT ACCTTTTC AGTTATTTTA ATTATACCAA. 130 

ATTATCAACT AGAATAGGTT CATCCATAZG AAATATAAAA TGAAGAGACA CCTAGGCTCT 240 

ATCAGGCTTA. GGATTCTTTG Ar.CTTATTTC CACTTTAATT TCTCAGTGGA AGTTAAGAGG 300 

30. 

GGTGAGAAAA. CAAAGAAGGG GAA-AACTGA CAACTAACAA AACCAGCACC ACATCGCTAG 350 

■GTGGTC-CrTA CTAA.TTACCT TCTCAGGATT TTCCTCAGAT TGAAAAGCTT ATGAGGATTT ' 420 

35 CTTGGGAGOC ■TTAATAACCT GCCTGTTAGT ACAGAGCTTT CCTGATGATA TTTACTCTTG 480 

AGCACATGTG GT7TGTAAAAC CTTAACTTTC TTTCTCCA3G AGGGTGGTGA TAGAAACAGA 540 

TGGTAGTA.TT TATGAACTGA TGTTCTCGTG AAATGTTGAG GGTGGGGAGA ^AAJ^AOTTTA 600 

40 

AGGGAGGAGA GCCATCTATT TTGTTCCTAA AGCCACCTCT CAGCAGAATC GTCATGTTTT 660 

TCTGATGCAC CGCTCTGCTT CATGCCCAAG ATGACTTGCG AGGCAATCTC AGGAGCTGTG 720 

45 GACTTAACC?. TTGCAPAGCA CCTGTCTTT CTCAGCGTTC TCTGCAAGTC A3TAGGTGTT 73 Q 

AGT ATGGTTG CAAAGTTCAC TGTCTCA3CA "AAGTTGAACT GGGCTACCTC TCTACACCTG 840 

TTTCCTCAGA GGGAAAAATC TTGPiGACCAG ATGGTGGAGC TCTGGAGTCA GAGGA^ATGG ■ 9Q0 

50 

GTGTCTTCAG CACAAAGCTG CTGCTTTTAC TTCAGCCACT TCTGACATTT TTACATACCG 960 

AGCCTGAGAT TKTGTGATTA TCTCAAATCA AATCACTTTG ATGGAGATAA ATAATCAAAA 1020 

55 CIGTTT2ATA GTCATTGATT • TGGTGAGAAC AGTAATGGAA AATGGTGTTG AAGGACTTCT 1080 

CATTTTXGGA GCTTTCCTTC C-JGAGTCCTG GCTGATTGGT GTTCGCTGTT CATCTGAGCC 1140 

CCCAAAAGCA TTATTACTGA TACTTGCACA C\GTCAAAAG CGCAGACTGG ATGGATGGTC ' 1'2Q0 

60 
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TTTTATAA.GG CATTTAAGGC- TACACTACTG TGTTTCACTG ACCATACATT TTTCTTAGCC 1260 

CCTCAAGTAA TATAGCACAG AGTTA.TGAAT GACAATTCOG CTAACCATTC GTGTTCATAT 1320 

5 GTGCCTCTTC CCCTTACCAT CGTAATTCTC CAAACTGGTC ATAAAGGCAC TCTGTGAAGA — 138Q- 

TATTGGGGAC TGAC^TCTTA AGCTCTCACC TGGCTGCAGT AGGAAAGGCC AAACTGACGA 1440 

CAAAAAAAAA ATTCTTTATA AAGATGATAT GGTAACATGT ATCTTTGCCC TGGGTCTGGG . 1500 

L0 

TGGGTCCAGT CAGTCTCAGA TTTACAAGCA TTTAGGAGCC TAGGTAAAAG CTGCTAGTAT 1560 

TCTTTTAAAA GTTACATTTA TGACTTGCAA TGATAGAAAA CTCCTTCCAA TTAAA.TGGCA • ' 162 Q 

15 TTTTATAATA TTATGTGTGT ACTTGACAGT GTTAAAAATA CCCTCATACG TTATTGCATT 1530 

TGAT CTTCAC AGAAAGTGCA TTTTAACCAG TACTCTGGGT GCAATAAATA ATATGTAGAA 1740 

ATTTAACTCC TCCAATT.CCA GCATATCCAG TGAGTTTTGA CAGTGTGTTT ATGTGGAATG 1300 

20 

TTTAftGGATA TACAATTGTA CTTTATATAA ATTGGTTGTT GTTCTTCTTA AATGTGACAT 1360 

GAAATAA.TTG TGCTGCTACA TTATACTGGA AATTAACAC-G GGAAAAGGGA AGAGGTCTTG ■ 1920 

25 GCTCCCTTGA GGTTCTGCTA GTGGTGTTAG GAGTGGTTAC AACTGAGCTT TTAGTAACCA 1930 

TTTAACCGTA, TGTAAACTTG GTTTCTAAT,T \AAAAAAAAAT TTCTTTTTCC A 2031 

30 

iZ) INFOEMATION FOR SEQ ID MO: 33: 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 971 base pairs 

(3) TYPE: nucleic acid 
.(C) STRAMDEDNESS : double 
(D) TOPOLOGY: linear 

40 Ui) SEQUENCE DESCRIPTION: SEQ. ID WO: 33: 

CGCGTCGGAA CTGGGCCGCG GCACATCCAC GGGGCGCGAG TGACACGCGG GAGGGAGAGC 60 

AGTGTTCTGC TGGAGCCGAT GCCAAAAACC ATGCATTTCT TATTCAGATT CATTGTTTTC 120 

45 

tTTTATCTGT GGGGCCTTTT TACTGCTCAG AGACAAAAGA AAGAGGAGAG CACCGAAGAA 130 

GTGAAAATAG AAGTTTTGCA TCGTCCAGAA AACTGCTCTA AGACAAGCAA GAAGGGAGAC 240 

50 CTACTAAATG CCCATTATGA CGGCTACCTG GCTAAAGACG GCTCGAAATT CTACTGCAGC ' 300 

CGGACACAAA ATGAAGGCCA CCCCAAATGG TTTGTTCTTG GTGTTGGGCA AGTCATAAAA 350 

GGCCTAGACA TTGCTATGAC AGATATGTGC CCTGGAGAAA AGCGAAAAGT AGTTATACCC 420 

55 

■ CCTTCATTTG CATACGGAAA GGAAGGCTAT' GCAGAAGGCA. AGATTCCACC GGATGCTACA 430 

TTGATTTTTG AGATTGAACT TTATGCTGTG ACCAAAGGAC CACGGAGCAT TGAGACATTT 540 

60 - AAACAAATAG ACATGGACAA TGACA.GGCAG CTCTCTAAAG CC GAGATAAA CCTCTACTTG 600 
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CAAAGGGAAT TTGAAAAAGA TGAGAAGCCA CGTGACAAGT CATATCAGGA TGCAGTTTTA ' 
GAAGATATTT TTAAGAAGAA TGACCATGAT GGTGATGGCT TCATTTCTCC CAAGGAATAC 
AATGTATACG AACACGATGA ACTATAGCAT ATTTGTATTT CTACTTTTTT TTTTTAGCTA 
TTTACTGTAC TTTATGTATA AAACAAAGTC ACTTTTCTCC AMJTTGTA.TT TGCTATTTTT 
CCCCTATGAG AAGATATTTT GATCTCCCCA ATACATTGAT TTTGGTATAA TAAATGTGAG 
GCTGTTTTGC AAACTTAAAA AAAAAWWAAA AAAACT SGAG GGGGGCCCGT ACCCAANTCG ' 
CCGNATATGA T 

(2) INFORMATION FOR SEQ 10 MO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1792 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) . TOPOLOGY : linear 

• (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GAACCCCCTT - TCTCCTGGTA AAGGGTAAGG GGGGGGATAA TGTTTACCAC AGGTACGAAA 
TAGTCACTTT AACATTGAGA CCTCTGCCTC ATTGAATTCA GGTTTTTTAA GTACTTGAAA" 
CTCTTCAGAT TCTCCTTATT TTAGTTTCTT TTTACATTTA TGAAGTAGAA AGCATTGTTT 
TGTAAACTGT TTTGAAAATA AATAGCCTAG TCTCTTATCC TCTTTAGCGT GGATTAAAGG 
TGAAGTTCTG CAAATGGGAG AGTGTTCACA GTAGATAGCT CAQATTGATT GAACACATTT 
GAGGAAGAGA CTCCTGCATG, AGATACCAGC ATTTTTACAA ATACTTTTTA TGTACATTCT N 
TTATTTTGTC ATTTTGTCAA CCCTCTCCCC AAGCACATCT TCTTTCCTTT TACTATGTCT 
ATGTAGGGAA AAACAAAACA AAAAATTGCA CTTACGTTAC ACTCCCAAAA TGTGGGTAAT 
CCGTGTCTTT CAAAAAACAT TTCTG.TTTTT TGTTTTGTTT TGGTCAGTCC ATTGCATAAG 
TGACAAGTTT GGGTGCTTGT GGCACGTATG TATGAAGCGG GAGGGGGATG ASAATTGCCT 
GTC'CTTCAGT ARGCTGTAAA AGTAATTTAC ATGTAAGTAA AAAGGGAAAA TAGAATAGAT 
GCCAAAGTCA TTTATTCAGT CCTTAGTTTT CTTATGTGGC ATTACTGCAT CTGCTAGTTA 
GTGAGAAAGC ACCCTCAGCT TTTACTGCTC CCCTCCCTGC CTGCCAACAC ACTTGATGTG 
TGCAAACAGC CCTCAAGTAT CTGTCAGATG ACCTATATAA GGTATTGAAT AAGGTATTCT 
TGTCAGTTTA GAAATGGACT GGATAAAACT TACTTGGTTG TCATTATTTT ATCTCATTTG 
TCCTGTTACA TGCCCTATGT TAAGATAATT ATATTGCCAC TAATAATCAA GATGCTAAAT 
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GAGTATTACA ACTGGCTAAT ATCATTTTTT ATATACAAGG GTATGTGTAT ATTTGGAATT 1Q2Q 

GRTA.TGAGAA ACTCATTTGT ACCCATTTGA GTGATATTGC ACAACAAAGA CAGATAYCTA 1080 

5 CAGACTCCGT TTTCATTTTC TCGTGTTCTT TATGATAATG ATCTTTGTAG ATTGGTTATT 1140 

TCTGTACTTT ATCTGTAATA AACTTTGTAG ATCCTGTGAA CCATTACTTT GCCTAAATCA 1200 

CT^GAGACTT GAGTCTTTAA TAACAAAGCA TCAATATTCA CTAAAGTCAA TCTCTTTTGA • 12 SO 

10 

GTTTCTGTGA CTTGGCTAGA AGCTCTTGAC ACTAAGGGAT TAGTGTTAAT TTTCCCTGGG 1320 

GGTGTTCCAC TAGGGCATTA CTGTATAATG ACTTGATGTT GGCACATAGA CTTCAAGATA 1330 

15 TATAATATTT TGAGGATTTT GTTGATTGGC CTATGTTTTA TTGCATAGTG TGAAACGTGT 1440 

AAAGCTTGGT TAACCTGTAT ATAGAT AGG T TATTGTTGAC TAGTTA.TAGT GTATTTAGGG 15 Q0 

TTGCCTGTAA TATTTAAGGT TCTTTACTGA TGTGTGTGGT GGTAGGAACA TATAATTTTT 1560 

20 

GTACATTATA TTTACTGAGA TGTTGGCTTT TTTATTTTAC AAATACTTTG GAATTC CAAT ( 1520 

GTGTTTTTTG CTTCCGTGAG GATTAATTTG GAAAGGTTTT TAATGACATT CCA.CTGATTT 1630 

25 • CAGATTTTGC TTGAGATTGA GTTCAATAAA TTGTCCTGTA TGTTCCAAAA AAAAATTAAA 1740 

AAACTCGAGG GGGGCCCGGT ACCCAAMNCG (tCGGATATGA TCGTAAACAA TC 1792 

30 

(2) INFORMATION FOR SEQ ID NO: . 35: 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 896 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNES3 : double. 

(D) TOPOLOGY*: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

„ . . AGTTGNANAC AAC AGG AC CT GAGTCCTTGG GCAGCACCAG TAGGTTGCCC CYTGCYTCYT 
GCCAGCYTCA CYTGCCACYT TYTGCCCCTY TCGGGATGCC TTCGCAGACA GAGYTYTTCG 

45 

CTGCCTGTGG TGGCCAYTCT TTGCTTTTGG TTYTCTTGCC CCTTGGCCTC CCTTTTTGTC 
CCCGGGCAGC CTTGTGTGAC CTGCCCTTTT CCCTCCCTTC CTTTCCAGGA CAAGCACGCC 
50 GAGGAGGTGC GGAAAAACAA GGAGCTGAAG GAAGAGGCCT CCAGGTAAAG CCTAGAGGCC 
' AAAGAACTTT CCAGGTCAGC CGGACAGCTC CAGCAGCTCC ACGTTCCAGG CAGCCTCGMC 
CGCCGGCTGC GCTCCCAGCA CTGGGGTTTG GGGGGAGGGG GGTGGCCAAG GGGCGTTTCC 

55 

TCTGCTTTTG GTGTTTGTAC 'ATGTTAAGAA TTGACCAGTG AAGCCATCCT ATTTGTTTCC 
GGGGAACAAT GACGGGGTCG GARAGGGGAG AGGAGAGAGT TTGGGAAAGG GAGATGGAGA 
60 AGAACTCAAG GACATTGCAA CCCTGCCCGG CGCAGATCTG ATTTTCACAT CTCTA.CCTGG 



60 
120 
130 
240 
300 
360 
420 
480 
540 
600 
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30 



ACATTGAGCC 


TCCCAGGCAC 


CATGTTGAGG 


AGAGATGAAA ACCAGGGCGG 


TAGAACTTCA 


660 


GGGTGAAGGA 


CAGAGTCGTG 


GGTGGGGCAG 


CGGCTGGAGG GCGCACCAGA 


GAACCCAGCC 


720 


agagggggtg 


TGAGTACCAG 


TGGTGTTGCT TCCACCCTGC AGCA3GTGGG ATGAGGTCTG 


730 


TGTGTGTGTG 


TCAACCATCA 


TTTTTTGATC 


ATCATGACCA ATGAAACATT 


GArAAAAAAA 


340 


AAAAAAACTG 


GAGGGGGGCC 


CGTACCCAAN 


TCGCCGNATA GTGATGGTAA 


ACAATC 


396 



15 (2) INFORMATION FOR SEQ ID NO: 36: ' 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 912 base pairs 
• (3) TYPE: nucleic acid 
20 (C) STRANDEDNE3 S : ' double 

(D) TOPOLOGY: linear 

<xi) " SEQUENCE DESCRIPTION: SEQ ID NO:. 36: 

25 ' TCGACCCACG CGTCCGGTCA GCCA.GTCGCA TCCAGCCATG ACAGCCTTCT GCTCCCTGCT 60 

CCTGCAAGCG CAGAGCCTCC TACCCAGGAC CATGGCAGCC CCCCAGGACA GCCTCAGACC 120 

AGGGGAGGAA GACGAAGGGA TGCAGCTGCT ACAGACAAA.G GACTCCATGG CCAAGGGAGC 130 

TAGGCCCGGG GCCAKCCGCG GCAGGGCTCG CTGGGGTCTG GCCTACACGC TGCTGCACAA 240 - 

CCCAACCCTG CAGGTCTTCC GCAAGACGGC CCTGTTGGGT GCCAATGGTG GCCAGCCCTG-- 300 

35 ARGGCAGGGA AKGTCAACCC ACCTGCCCAT CTGTGCTGAG GCATGTTCCT GCCTACCATC 360 

, CTCCTCCCTC CCCGGCTCTC CTGCCAGCAT CACACCAGCC ATGCAGCCAG CAGGTCCTCC 42 Q 

GGATCACYGT GGTTKGGTGG AGGTCTGTCT GCACTGGGAG CCTCARGARG GCTCTGCTCC 430 

40 

ACCCACTTGG CTATGGGAGA GC<^GCZGGG GTTCTGGAGA AAAAAACTGG TGGGTTAGGG 540 

CCTTGGTCCA GGAGCCAGTT GAGCCAGGGC AGCCACATCC AGGCGTCTCC CTACCCTGGC 600 

45 TCTGCCATCA GCCTTGAAGG GCCTCGATGA AGCCTTCTCT GGAACCACTC . CAGCCCAGCT 660 

CCACCTCAGC CTTGGCCTTC ACGCTGTGGA AGCAGCCAAG GCACTTCCTC ACCCCYTCAG '720 
- CGCCACGGAC CTYTYTGGGG AGTGGCCGGA AAGCTCCC3G GCCTYTGGCC TGCAGGGCAG " 730 

50 

CCCAAGTCAT GACTCAGACC AGGTCCCACA CTGAGCTGCC CACACTCGAG AGCCAGATAT 340 

TTTTGTAGTT TTTATKCCTT TGGCTATTAT GAAAGAGGTT AGTGTGTTCC CTGCAATAAA 900 

55 CTTGTTCCTG AG 912 



60 (2) INFORMATION FOR SEQ ID NO : 37: 
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(1) SEQUENCE CiaRACTSRISTICS : 

(A) LENGTH : 13 32 base pairs 

(B) TYPE: nucleic acid 

. 5 (C) STRAMDEDMES3 : dcucie 

(D) TOPOLOGY: linear 

(xi) SZyJZXCE DESCRIPTION: SEQ ID NO: 37: 

10 AATTCGGCAC GAGCGGAGOC GAGGGAAACT RAGGGCGAAA GTTGTGTGTC GTGTTGGCAG ■ SO 

GAGGGCCTAG AAGGGAAAGA CTGTCTAGTG GGACAATGTC ATATTA.TAAA TTTGGAATGC 120 

TGAATAGAAA. ATTATAGATT TTGATATTGA AGGAAATGAA GCGAAGCYTA AA.TGAAAATT 130 

15 

CAGCTC'GAAG TACAGCAGGC TGTTTGCCTG TTCCGTTGTT CAATCAGAAA AAGAGGAA.CA 240 

GACAGCCATT -AACTTCTAAT CCACTTAAAG ATGATTCACG TATCAGTACC CCTTCTGACA 300 

20 ATTATGATTT .TCCTCCTCTA CCTACAGATT GGGCCTGGGA AGCTGTGAAT CCAGAGTTXG. 360 

CTCCTGTAAT G AAAACAGT G GACACCGGGC AAATACCAGA TTCAGTTTCT CGTCCTCTGA 420 

GAAGTCAAGA TTCTGTCTTT- AACTCTATTC AATCAAATAC TGGAAGAAGC CAGGGTGGTT 430 

25 

GGAGCTACAG AGATGGTAAC AAAAATACC A GCTTC-AAAAC TTGGRATAAA AA.TGATTTTA 540 

AGCCTCAATG TAAACGAACA AACTTAGTGG CAAATGATGG AAAAAATTCT TGTCCAATGA 600 

30 GTTCGGGAGC TGAACAAC^A AAACAATTAA GAACACCTGA ACCTCCTAAC TTATCTCGCA 660 

AGAAAGAAAC CGAGCTACTC AGACAAACA.C ATTCATCAAA AATATCTGGC TGCACAATGA 720 

GAGGGCTAGA CAAAAACAGT GCACTACAGA CACTTAAGCC CAATTTTCAA CAAAATCAAT 730 

35 

ATAAGAMACA AATGTTGGAT GATATTCGAG AAGACAACAC CCTGAA.GGAA ACCTCATTGT '340 

•* - 1 

ATCAGTTACA GTTTAAGGAA AAAGCTAGTT CTTTAAGAAT TATTTCTGCA GTTATTGAAA 900 

40 GCATGAAGTA TTGGCGTGAA CATGCAC-.GA AAACTGTACT TCTTTTTGAA GTATTAGCTG 960 
TTCTTGATTC AGCTGTTACA GCTGGCCCAT ATTATTCGAA GACTTTTCTT ATGAGGGATG ' 1Q2Q 

GGAAAAATAC TCTGCCTTGT GTCTTTTATG AAATCGATGG TGAACTTCCG AGACTGATTA 1030 

45 

GAGGCCGAGT TCATAGATGT GTTGGCAACT ATGACCAGAa' AAAGXACAT7 TTCC^ATGTG 1140 

TTTCTGTCAG ACCGGCGTCT GTTTCTGAGC AAAAAACTTT CCAGGCATTT GTCAAAATTG 1200 

50 CAGATGTTGA GATGCAGTAT TATATTAATG TGATGAATGA AACTTAAGTA GTGATAAAAG 1260 

GAAGTTTAGC ATAAATTATA GCAGTTTTCT GTTATTGCTT AATTTACCAT CTCCATAGTT 1320 

TTATAGCTAC TATTGTATTT CACTTGTTGA ATTAAAGTAT TTGAATTCTT TTAAAAAAAA 1330 

55 

AA 1332 



60 
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(2) INFORMATION FOR SEQ ID MO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 372 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNES3 : double 

(D) TOPOLOGY: linear 

(;ci) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GGGCTACTTC AAAGCCCTGG GCCTTATTT C TTCAGGTAAA AAAATATAAA GTCAGATCTC 
ATCCCGGCTG GCCATGCTGT TAGACCCTTT CATCCTTCTC TTCTGCCTCT TCTCAACAGC 
TGCCCAGTCC " TGTTTGGAAT TCATATACAT ACAGTTCTAA TACTGATGTA TTTACCCTCA 
TAAGCCACTC AACCCAGAAT CTTATTTGAA TTATAATCCA GAAACATCAG GTGACGTGTG 
AGACTACTGT ATGAGAAAGA GACAGTTTAA GGGTCAGTCC AATGGAAAAA AGAGTTCTCA 
GAGCTTTCTT TAGCTTATTC TCATCAAAGA GCTTTCTCTG CAGAAGGAAC CTACTGGTTC 
CTCCTTTCCA GTCCTAGAAA TCCTGACCTA GAGTGGCTTA ATCCTGCTAG CACCTCTCTC 
TCGCACTCTG GTGCCAAATG ACTCCAGGAA CTGGGCCATG ATGTGGTGGG AATGACCTTA 
CCCTGAGCAT GTCACTCATG CATTGAAGAA ' CAGCTAAGAG CAGAGCTTAG AGCTTAGAGC 
TGGGCCCTGT AAGGTGAGAG GAATCACATC CTGCAGAAGT CTGTCCTGAG AAGCAGGTAC 
f CCTGTCACA GCAGAGACAC AGTGGATACC TGAGTAACAA TAATACAAGA CAGGACGTGG 
GMACAGCAAA AGATTTGGGT GTCAGAAGAR GCCGAGAACA CTTYCA&GCA GGAACATTCA 
RARTTGTTCT TGGAGGAART AGGCMC3AAG GCTGGGCAGG ATTTCMCGGG GCAGAGA.TGG 
AGCAAGCAAT TGAAATGAAA GCCATGGCAT GGGAAAAGGA GCACTGGCCA CAGGGAGTGC 
AACGTTGTGA TGCAAGGCCA CTGTGGAGCC AT 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS : ' 

(A) LENGTH: 312 base pairs 
<B) TYPE : nucleic * acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear ' ' 
(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 39: 

GGCAGAGGCT CACCCCAGCA GAGATTGAGG GGGAACCGTG ATGAAATTTT TAAGTATTCT 
GCTTGATGAT AATAATTTTY CTCTTATGTT AATGTTGGCT CCGTTTGGGT GTTTAGCTTT 
TGAAAGGAGT ATGAAAATGC GGAATGGGGC TTTGGGGCTT GAGGAGGTGT GATCTCTAGT 
GTTTAAAAAA TTTAATTGCA CAAATAGAAA TAATTCACCC ACATTATTGA ACCCCACTAA 
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AGCATATCCT TTTTGTCCAT ATTCCTTTCC TGCTGCCCTC GTGTGTACCA TTATTACTCA 300 

GTTGTGA.TTT GAGCTCGTTC CACTTAAAGT CATTCATAGA TACTTTTGCG TCGTGTTKGA 360 

5 

ATATTTATTG AATTTCTATT CTGTGTTTTA CTTAATTACT TTATTATGGA ACCTTTACAC 420 

AGGTCTGGTG TACTTGTTCT TTGAAAAGTC TTATGTTGAC CACCATCACT GAGCATATAG 480 

10 CTTTTTCCTT ATTTCCTTGG GATAATTACC CGAAGTGGAA ATACCGAATC AAACTTCTGT 540 

TTTCTTTCTT TGGCACTATT ATATAAATTG TTTTCCAAAC AAGGCATGTT TACAATAGAC 600 

ATTTTTCAAA ATCTGGGTAT TTGTCCTA.TT TTGCTCTCTG TA.TGCAGAAT TCAGCGGGGT 660 

r"5 

GCCAAGTCGT TTTCTGTGTG GGTTGAGAGA CAGGCTGTGC AGCCCACTGT TGCATAGGAC 720 

TAACTACTAC AAATCATGCT GAGACCGAGC TATTTTTGCT GCTTAGARGC TTTGCAGCCT 780 

20 TGAGTAAGTT TCGNCATCTG GAAACNTTGN AA 312 



25 (2) INFORMATION FOR SEQ ID NO: 40: 

(1) SEQUENCE CHARACTER I ST^CS : 

(A) LENGTH : 1515 base pairs 
(BJ Tf9S : nucleic acid 
30 (C) STRAMDEDKE3S : double 

_ ID) TOPOLOGY: linear 

(:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

35 AATTCGGCAC GAGGGAAATT CAAGCACTTT TCCTAAAAGA AGGGGGAATG GATGCTGAAA 60 

CAACACGTNT C C 0 AC AAAGG GAGCAGACAC -TGGGCTTGTG AAGCTGCCCC ATACCTTCCC 120 



CACAGAACTG GGGTCCGGCC TCCCTGACAT GCAGATTTCC ACCCAGAAGA CA.GAGAAGGA 180 

40 

GCCAGTGGTC ATGGAATGGG CTGGGGTCAA AGACTGGGTG CCTGGGAGCT GAGGCAGCCA 240 



CCGTTTCAGC CTGGCCAGCC CTCTGGACCC CGAGGTTGGA CCCTACTGTG ACACACCTAC 300 



45 CATGCGGACA CTCTTCAACC TCCTCTGGCT TGCCCTGGCC TGCAGCCCTG 'TTCACACTAC 360 



CCTGTCAAAG TCAGATGCCA AAAAAGCCGC CTCAAAGACG CTGCTGGAGA AGAGTCAGTT 420 

TTCAGATAAG CCGGTGCAAG ACCGGGGTTT GGTGGTGACG GACCTCAAAG CTGAGAGTGT '. 430 

50 

GGTTCTTGAG CATCGCAGCT ACTGCTCGGC AAAGGCCCGG GACAGACACT TTGCTGOGGA 540 



TGTACTGGGC TATGTCACTC CATGGAACAG CCATGGCTAC GATGTCACCA AGGTCTTTGG 600 - 



55 GAGCAAGTTC ACACAGATCT CACCCGTCTG GCTGCAGCTG AAGAGACGTG GCCGTGAGAT 660 



GTTTGAGGTC ACGGGCCTCC ACGACGTGGA CCAAGGGTGG ATGCGAGCTG TCAGGAAGCA 720 



TGCCAAGGGC CTGCACATAG TGCCTCGGCT CCTGTTTGAG GACTGGACTT ACGATGATTT 7-80 

60 
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CCGGAACGTC TTAGACAGTG AGGATGAGAT AGAGGAGCTG AGCAAG AC CG TGGTCCAGGT 840 

GGCAAAGAAC CAGCATTTCG ATGGCTTCGT GGTGGAGGTC TGGAACCAGC TGCTAAGCCA. 9Q0 

5 GAAGCGCGTG ACCGACCAGC TGGGCATGTT CACC-CAC-AG GAGTTTGAGC AGCTGGCCCC 960 

CGTGCTGGAT GGTTTCAGCC TCATGACCTA CGACTACTCT ACAGCGCATC AGCCTGGCCC 1020 
TAATGGACCC CTGTCCTGGG TTCGAGCCTG CGTCCAGGTC CTGGACCCGA AGTCCAAGTG " 1Q80 

10 

GCGAAGCAAA ATCCTCCTGG GGCTGAACTT CTATGGTATG GACTACGCGA CCTCCAAGGA 1140 

TGCGCGTGAG CCTGTTGTCG GGGCCAGGTA CATCGAGACA CTGAAGGACC ACAGGGCCCG 1200 

15 GATGGTGTGG GAGAGCCAGG YCTCAGAGGA CTTCTTCGAG. TACAAGAAGA GGGGCAGTGG 1260 

GAGGCACGTC GTCTTCTACC CAACCCTGAA GTCCCTGCAG GTGCGGGTGG AGCTGGCCCG 1320 

GGAGCTGGGC GTTGGGGTCT CTATCTGGGA GCTGGGCCAG GGCCTGGACT AC TTGTACGA 1330' 

20 

CCTGCTCTAG GTGGGCATTG CGGCCTCCGC GGTGGACGTG TTCTTTTCTA AGCCATGGAG 1440 

TGAGTGAGCA GGTGTGAAAT ACAGGCCTTC ACTCCGTTAA AAAAAAAAAA \AAAAAAAAAA 1500 

' 25 aaaatAAAAA aaaaa t 1315 



30 (2) INFORMATION FOR 3EQ ID NO: 41: ■ 

(i) SEQUENCE CHARACTERISTICS:- * 

(A) LENGTH: 704 base pairs 
(3) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

40 AAGATGGTGG CGCCCAGAGC TTCGCTCTAT GCTGCTCCCC TGAGAGAGGC GTTTCCATCA 60 

ACCAGTTTTG CAAGGAGTTC AATGAGAGGA CAAAGGACAT CAAGGAAGGC ATTCCTCTGC 120 

CTACCAAGAT TTTAGTGAAG CCTGACAGGA CATTTGAAAT TAAGATTGGA CAGCCCACTG 130 

45 . 

TTTCCTACTT CCTGAAGGCA GCAGCTGGGA TTGAAAAGGG GGCCCGGCAA ACAGGGAAAG 240 
AGGTGGCAGG CCTGGTGACC TTGAAGCATG TGTATGAGAT TGCCCGCATC AAAGCTCAGG ; 300 

50 ATGAGGCATT TGCCCTGCAG GATGTACCCC TGTCGTCTGT TGTCCGCTCC. ATCATC'GGGT 360' 

CTGCCCGTTC TCTGGGCATT CGCGTGGTGA AGGACCTCAG TTCAGAAGAG CTTGCAGCTT 420 

TCCAGAAGGA ACGAGCCATC TTCCTGGCTG CTCAGAAGGA GGCAGATTTG GCTGCCCAAG 430 

55 

AAGAAGCTGC CAAGPAGTGA CCCTTGCCCC ACCAACTCCC AGATTTCAAA GGAGGTAGTT 540 

GCAAAAGCTG TGCCCAAGGG GAGGAAGGAG GTCACACCArs. TATGATGATO GTTTTCATGA 60 Q. 

60 CTTTGAATGA TATATTTTTG TACATCTAGC TGTATCGAGG CATCAGGCCT GAATAAACAT 6'6Q 
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CCTTTCTTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 704 

5 

(2) INFORMATION FOR SEQ ID NO:. 42: 

(i) SZQUEXCZ CHARACTERISTICS: 
10 (A) LENGTH: 1094 base pairs 

(3) TT?E : nucleic acid 

(C) STRANDEDMESS : dcubia 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

GCCAC-CTTTC TTACAAACCC ATCCTTCTGA AATGTTGCTT C^AATTC^TC CTCTGCTCCC 60 

CAGTCCCACT ATTCCACACA TACTGTTACT GTTTCTTTAT CCTACTTTCT CAATTTTGGA 120 

20 

ACATAGTTGC AGTTTACTGCA TTGAATACCT GTGGGTTTGC CTGTTGTTCT GTCTGTCTCT 130 

GTGGTTCTTG TAATANTGGA TCCCAGAGAT AAAATGGACA GTTGTNATGC ACAGTTAATT 240 

25 CAGAAACTAG ' ACCTTACTTG CTGTGTGAAA TACCAACTAA ATTCTCAGTG AACTCAGCTG 300 

ANCTTTATCT CCTTTTGTTT CCCC^ATTTA TAATTTCAGT TCAGGCCCAG AAAGATGGAA 360 

TCCCAGCTAA GAAATACAAG TTACACCCTG T ACTAGCAGC ' CCATGTGTGC ATGTTCTTTA ' 420 

30 - 

AGTGCTCTTG CAGCTATGTC ATTTATATTG ATTTCCCTGT ATTATTATAA GCAAAGCAAA 430 

TTTGAGGAA-A AAAACCCATA ATACCACACC TCATTTTTTT CAAGTAATAG GGTCATAAGT 540 

35 CTCATYCTYC ATATAATATG TTGAGTATGC AGTATATTAT GTGTTAGGCT CTGGANAGGC 600 

AGAGGTTAGA TCATGTWACA GATQATATCK GATTAGGCAG ATAAACAGTA TTTTAACCTT 650 

TTCCTTATTA TATGTAACTT GCTTTCAGGT TTTTTAATGT TACTATTATG TCTTTAATAT 720 

40 

ATTATCTTTA TTTGTACTTT TGTATACAGA GTGATTTTCC TTTTTTAAAA AAAATTGTGT 730 

CTTTAGGATG GATTCCAAAG ATGTGGAATC AGTAGGTTTA AGGAATATGG ATATTTTGGC 340 

45 TGGCAAGGTG GCTCACACCT GTAATCCCAG CACTTTGGGA GOCTGAGOVZ- GGTGGATCAC 900 

CTGAAGTCAG GAGTTCGAGA CC\GCCTGAC CAACATGGCG AAACCCTGTT TNTACTAAAG 960 
ACACACaMAA AATTRGCCAG TGGTGGTGGC ATGTGCTTGT AGTCCCACTT AGCTACTCGA ', 1020 

50 

GAGGCTGAGG CAGGAGAATC GCTTGAACCC GGGAGGCAGA GGTTGCAGTG AGGCAAGATG 1080 

GCACCTCTAC ACTC 1094 

55 

(2) INFORMATION FOR SEQ ID NO : 43: 
60 (i) SEQUENCE CHARACTERISTICS: 
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■'.-J 1521 case pairs 

''3) TVGZ: nucleic acid 
■C> ST7«:iE2DSZS3 : double 
[Z) TCrCLCGY: iinear 

■' :ci } SZ;i^=;CZ rZSGHSTZGM: S£Q ID MO : 42: 

tgc<tttaggc c-.rrL-.czrr: cccttggctg gaactactgg acagaccctt ttgagatgtg 

CCTG7GGTGC TG^GGAGATG TGTGTAGTGG TCTTAGCTCT TTGTTGAGCT TGTGTGTGTG 
TTGCGTAGCC TCAGCTGTAG GCTCAAATTG GGCGTGTGTT GGAGGGCTTC TTAGCTCTTT 
GGTCAGA.T7G C-.TTTCTATG 7GTTTGTA.TC ASCTGAATGT TGCTGGAAAT AAAACCTTGG 
TTTGC^AGG CTCT-TCTTG TGGGAAGTAA. GTAGGGGAAA AGGTCTTTGA GGGTTCCTAG 
GCTCCTTCGT AJC-ACAjGGAA AATGZCTC-A AGCCTTGCTT CCCAGCAACC TGGGGCTGGT 
TCCCAGTGCC TGCGCCiGCC CCTTCC7GGT TCTTATCTCA AGGCAGAGCT TCTGAATTTC 
AGC-GGTTCAT TCC^GAGCCC TCTTGTGGC C AGGCCTTCCT TTGCTGGAGG AAGGTACACA 
C-GGGG.-A.Gn: CAGGCTGTA.C TTGGGGGATC TCCTTGGCCT GTTCCACCAA GTGAGAGAAG 
GTA.CTTACGC TTGCACCTCC TGGTCAGCCA GGTGCATTAA CAGACCTCCC TACAGCTGTA 
GGAACTACTG CCCCAGAGCT GAj3GCAA.GGG GATTTCTCAjG GTCATTTGGA GAACAAGTGC 
TTTAGTAGCA. GGCCAAAGTA GTAAC7GCTA CTGTATTTAG J TGGGGTGGAA. TTCA.GAA.GAA. 
ATTCC-AGAC C-JC-CCATC-G GTGGTCTC-CA. TGTGAATGAA CAGGAAjTGAG CCGGACAGCC 
TGGGTGTO.T TGCCTTCTTC CTCCCC-.TTT GGACCCTTCT CTGCCCTTAC ATTTTTGTTT 
CTCCATCGAC CAC CATCCAC CAGTCTATTT ATTAA.CTTAG CAAGAGGACA AGTAAAGGGC 
CCTCTTGG-T TGA7TTTGCT TCTTTCTTTC TGTGGAGGAT . ATACTAAGTG CGACTTTGCC 
CTA-CCCTATT T3CAAATCCC TAA-C-.GAA.TT GAGTTTTCTA TTAAGGATCC AAA.AAGAAAA. 
ACA.-AATGCT AATCAAGCC 1 . TCAGTCAAGG GTCACATGCC AATAAACAAT AAA.TTTTCCA 
GAAjGAAACCA AATCCAACTA. GACAAA.TAAA GTAGAGCTTA_ TGAAATGGTT CAGTAAGGAT 
GAGGTTGGTG TtTGTTGTTT TGTTTTGTTT TGKTTTTTTA AAGACGGAGT CTCGCTCTGT 
CACTCAGGGT GGAGTGCAGG GGTATGATCT TGGCTCACTG TAACCTCCGC CTCCCGGGTT 
CAAGCCACTC TCCTGCCTCA GTCTCCTGAG TAGCTGGGAT TACAGGTGCG TGCCACCATG 
CCTGGCTAAT TTTTGTGTTT TTAGTAGAGA CAGGGTTTCA OCATGTTGGT CGGGCTGGTC 
TCAAACTC'GT GACCTCTTGA TCCGCCTGCC TTGGCCTCCC , AAAGTGA.TGG GATTACAGAT 
GTGAGCC-.CC CGCGCCCTAG CC^AGGATGA.' GA.TTTTTAAA GTATGTTTCA GTTCTGTGTC 
ATGGTTGGAA GAjCAGAGTAG GAAGGATATG GAAAAGGTCA TGGGGAAjGCA GAGGTGATTC 
ATGGCTCTGT GAA.7TTGAGG TGAATGGTTC CTTATTGTCT AGGCCACTTG TGAAGAATAT 
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GAGTCAGTTA TTGCCAGCCT TGGAATTTAC TTCTCTAGCT TACAATGGAC C T T T TG ftACT 
GGAAAACACC TTGTCTGCAT TCACTTTAAA ATGTCAAAAjC TAATTTTTAT AATAAATGTT 
TATTTTOIZA TTGAAAAAAA AAAAAAATTT AAAAACYCGG GGGGGGCCC5 GWACCCCATT 
NGCCCCTAAG GGGGGGGGTT T 

(2) INFORMATION FOP, SEQ ID NO: 44: 

(i) SEQUENCE CHA-RACTERISTIC3 : 

(A) LENGTH: 1024 base pairs' 
(3) TYPE: nucleic acid 
(C) STRA3MDEDNE3S: double 
(0) TOPOLOGY: linear 

(:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GGGGCACAGT TGAAGAAGCG ACCGAGGGAC TGGGAGTCGT TAGTGAGGAT GACGCGGCAT 

GGCAAGAACT GC^.CCGCXGG GCCGTCTACA CCTACCACGA GAAGAAGAAC" 'GACACAGCGG 

CCTCGGGCTA TGGGACCCAG AACATTCGAC TGA'GCCGGGA TGCCGTGAAG GACTTCGACT 

GCTGTTGTCT CTCCCTGCAG CGTTGCCACG ATCCTGTTGT CACCCCAGAT GGCTACCTGT 

ATGAGCGTGA GGCCATCCTG GAGTACATTC TGCACCAGAA GAAGGAGATT GCCCGGCAGA 

TGAAGGCCTA CGAGAAGCAG CGGGGCACCC GGCGCG&CGA GCAGAAC-GAG CTTCAGCGGG 

CGGCCTCGCA GGACCATGTG CGGGGCTTCC TGGAGAAGGA GTCGGCTATC GTGAGCCGGC 

CCCTCAACCC TTTCACAC-CC AAGGCCCTCT CGGGCACCAG CCCAGATGAT GTCCAACCTG 

GGCCCAGTGT GGGTCCTCCA AGTAAGGACA AGGACAAAGT GCTGCCCAGC TTCTGGATCC 

CGTCGCTGAC GCCCGAAGCC AAGGCCACCA AGCTGGAGAA GCCGTCCCGC AC GGTGACCT 

GCCCCATGTC AGGGAAGCCC CTGCGCATGT CGGACCTGAC GCCCGTGCAC TTCACACCGC 

TAGACAGCTC CGTGGACCGC GTGGGGCTCA TCACCCGCAG CGAGC'GCTAC GTGTGTGCCG 

TGACCCGCGA CAGCCTGAGC AACGCCACCC CCTGCGCTGT GCTGCGGCCC TCTGGGGCTG 

TGGTCACCCT CGAATGCGTG GAGAAGCTGA TTCGGAAGGA CATGGTGGAC CCTGTGACTG 

GACA.CAAACT CACAGACCGC GACATCATCG TGCTGCAGCG GGGCGGTACC GSTTCGCGGG 

CTCCGGAGTG AAGCTGCAAG CGGAGAAATC AGGGCCGGTG ATGCAGGCCT GAGTGTGTGC 

GGGAGACCAA ATAAACCGGC TTGGGTGCGC AAJ\AAAAAAA AAAAAAAAAA AAAAAAAAAA 

AAAA 
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(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE OiAHACTERISTICS : 

(A) LENGTH: 983 base pairs 
(3) 'TYPE: nucleic acid 

(C) ' STRAMDEDNES5 : double 

(D) TOPOLOGY: linear ■ - 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
CGACA.CGGCT GCGAGAAGAC GACAGAAGGG CCCGACCGCG AGCCGTCCAG GTCTCAGTGC 
TGTGCCCCCC CCAGAGCCTA GAGGATGTTT CATGGGATCC CAGCCACGCC GGGCATAGGA 
GCCCCTGGGA "ACAAGCCGGA GCTGTATGAG GAAGTGAAGT TGTACAAGAA CGCCCGGGAG 
AjGGGAGAAGT ACGACAACAT GGCAGAGCTG TTTGCGGTGG TGAAGACAAT GCAAGCCCTG 
GAGAA.GGCCT ACATCAAGGA CTGTGTCTCC CCCAGCGAGT ACACTGCAGC CTGCTCCCGG 
CTCCTGGTCC AATACAAAGC TGCCTTCAGG CAGGTCCAGG GCTCAGAAAT CAGCTCTATT 
GACGAA.TTCT GCCGCAAGTT CCGCCTGGAC TGCCCGCTGG CCATGGAGCG GATCAAGGAG 
GACCGGCCCA TCACCATCAA GGACGACAAG GGCAACCTCA ACCGCTGCAT CGCAGACGTG 
GTCTCGCTCT TCATCACGGT CATGGAOuAG CTGCGCCTGG AGATCCGCGC CATGGATGAG 
ATCCAGCCCG AC CTGCGAGA GCTGATGGAG ACCATGCACC GCATGAGCCA CCTCCCACCC 
GACTTTGAGG GCCGCCAGAC GGTCAGCCA.G TGGCTGCAGA CCCTGAGCGG CATGTCGGCG 
TCAGATGAGC TGGACGACTC ACAGGTGCGT CAGATGCTGT TCGACCTGGA GTCAGCCTAC 
AACGCCTTCA ACCGCTTCCT GCATGC CTGA GCCCGGGGCA CTAGCCCTTG CACAGAAGGG 
CAGAGTCTGA GGCGATGGCT CCTGGTCCCC TGTCCGCCAC AC AGGC CGTG GTCATCCACA 
CAACTCACTG TCTGCAGCTG CCTGTCTGGT GTCTGTCTTT GGTGTCAGAA CTTTTGGGCC 
GGGCCCCTCC CCA.CAATAAA GATGCTCTCC GACCTTCAAA AAAAAAAAAA AAAAAAAAGR 
KGSGGCCGGT CCCCAOTCCC CCC 

(2) INFORMATION FOR SEQ ID NO: 46 : 

CD SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid • 
.(C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(Xij SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

CCGGCTGATC GCTGCOGCTC CGCCAATACA ATAGAGCCAK CCACTACCAG CAGCCTGGCC 
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CTCTTCCTCC TTCTC CAGAG AGACCAATGC AGCCGAACTC GGOGTTT.GCC TGAGGAGAAG 120 

GAGGAAGTGA CCATGGACAC AAGTGAAAAC AGACCTGAAA ATGATGTTCC AGAACCTCCG 1.30 

ATGCGTATTG GAGACCAAGT CA.GCAATGAT GACCGCCCGG AGGGCAGTGT TGAAGATGAG 240 

GAGAAGAAAG AGAGCTCGCT GCCCAAATCA TTCAAGAGGA AGATGTCCGT TGTCTCAGCT 300 

ACCAAGGGGG TGCCAGCTGG AAACAGTGAC ACAGAGGGGG GCGAGCGTGG TCGGAAACGA .360 

CGCTGGGGAG CCAGCACAGC CACGACACAG AAGAAACGTT CCATCAGTAT CACCACTGAA 420 

TCACTAAAGA GCCTCATCCC GGACATCAAA CGCCTGGCGG GGCAGGAGGC TGTTGTGGAT 430 

15 . CTTCATGCTG ATGACTCTCG CATCTCTGAG GATGAGAGAG AGCGTAATGG GGATGATGGG 540 

AGCCATGACA AGGGGCTGAA AATATGCCGG AGAGTGAGTC AGGTAGTACC TGCAGAGGGC 600 

CAGGAGAATG GGCAGAGGGA AGAAGAGGAA GAAGAGAAGG AAGCTGAAGC AG AACCTC CT 660 

20 

GTACCTCCCC AGGTGTCAGT AGAGGTGGCC TTGCCCCCAC CTGCAGAGCA TGAAGTAAAG 720 

AAAGTGACTT TAGGAGATAC CTTAACTCGA CGTTGCATTA GCCAGCAGAA GTCCGGAGTT 730 

25 TCCATTACCA TTGATGACCC AGTCGGAACT ' GZ CCAGGTGC CCTCCCQJkCC CCGGGGCAAG • 340 

ATTAGCAACA TTGTCCATAT CTGCAATTTG GTCGGTCCTT TCACTTTAGG CCA.GCTAAAG SGC 

GAGTTGTTGG GGCGCACAGG AACCTTGGTG GAAGAGGCCT TCTGGATTGA CAAGATCAAA ' 960 

30 

TCTCATTGCT TTGTAACGTA CTCAACAGTA GAGGAAGCTG TTGCCACCCG CACAGCTCTG 1020 

CACGGGGTCA AATGGCCCCA GTCGAATCCC AAATTC'GTTT GTGCTGACTA TGCCGAGCAA 1 1080 

35 GATGAGCTGG ATTATCACCG AGGCCTCTTG GTGGACCGTC CCTCTGAAAC TAAGACAGAG 1140 

GAGCAGGGAA TACCACGGCC CCTGCACCCC COvCCCCCAC CCCCGGTCCA GCCACCACAG 1200 

CACCCCCGGG CAGAGCAGCG GGAGCAGGAA CGGGCAGTGC GGGAAGAGTG GGC&GAACGG 1260 

40 

GAACGGGAAA TGGAGCGGCG GGAGCGGACT CGATCAGAGC GTGAATGGGA TCGGGACAAA ' 13 20 

GTTCGAGAAG GGCCGCGTTC CCGATGAAGG TCCCGTRACC GCCGCCGCAA GGAACGTGCG 1330 

45 AAGTCTAAAG AAAAGAAGAG TGAGAAGAAA GAGAAAGCCC AGGAGGAAGC ACCTGCCAAG 1440 

CTGCTGGATG ACCTTTTCCG ' AAAGACCAAG GCAGCTCGGT GCATCTATTG GCTCCCACTG 1500 

ACTGACAGGG AGATGGTTCA GAAAGAGGCA GAGGGGGGCG AACGGCOCAA GGAGCGGGAG -.1560 " 

50 

AAGCGGCGAA AGGAGCAAGA AGAAGAAGAG C\AAAGGAGG GGGAGAAGGA AGGGGAGCG-G 1620 

GAACGGAACG GACAGCTGGA GCGAGAGAAA CGTCGGGAGC ACAGTCGGGA GAGGGACAGG ^ 1630 

5 5 GAGAGAGAGA GAGAAAGGGA . GCGGGACAGG GGGGACGGAG ATCGGGATAG GGAAAGGGAC 1740 

CGAGAACGAG GGAGGGAAAG GGATCGCAGG GACACGAAGC GCCACAGCAG AAGCCGGAGT 1300 

CGGAGCACAC CTGTGCGGGA CCGGGGTGGG CGCCGCTAGC TGGGAAAACA CTAGAGCTGC 1360 
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AGGTACCAGC CACTCGGCCC CAGGGGGTTA TGGCCACAGA GGGATAGGCA CAGTCTCCAC 
CACCCTGGAG CCAAGGGTCT TTCACATCAC CTATCCCTAC ATACATACCA AATGGAAAAG 
TGGCGATCCT TTTCCCCCCA AACACACCCC CTTAACCTAT CTCTTGGGAC TTAGCCCGAC 
CCTCCCTCTC ATTTCCCATT AAGTCTGAGA GGCAAGAGCT AGGTTAGGCA AGGAGGTGGT 
TGGCCAGAGA TGGGGAACAG CCAGGTGCCC CAGTCCTCTG ATTTTTCCTC CATCCTGCTT 
ACCACCTCCC TGGGTACTTA CAGCCTTCTC TTGGGAACAG CCGGGGCCAG GACTGGGTCA 
CCTATGAGCT GAATCAGCAT CTCCTCCTGA GTCCCAGGGC CCCTGCAGTT CGCAGTCTCT 
TCTGTCCTGO AGCCCTTGCC TCTTTCCCAC AGGTTCCACT TTATATCCAG CTTTTCCTTT 
TGTTCAATTT TTATTTTTAT TTTTTTTATT ATTAAATGAT GTGGTCTATG GAAAAAAAAA 
TAAAAATCTG ACTTA.GTTTT A 

(2) INFORMATION FOR SEQ ID NO : 47 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(3) TYPE: nucleic acid * ~ 

(C) STRANDEDNE55 : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CTCAAACTCC TGAGCTGAAG CGATCTACCT GCCTCAGCTA GGATTACAGG TGTGAGCCAC 
CGCACCCAAC CTCAATAAGC KTATTTGATA AAAKATATGC AAGCTCCCTT TATKCACTTT 
TCATTCAGAA TGTTTAGTAA TTTGTATTGT TTTTCAGATT TTCAGC CCAA TATATCTCCi 
TGCCCACTGT GTCACTGTAT TCTACCTAWA CATCATCACG TGTTTCTGCT ATTGGCTGTA 
TGATGGAACA CTGCGGCTCA TTTTC CTGAA AACTGCCGAT AGTGCATAGA RTGCTGGGAT 
GGAAACCAGA ARCTTTGAAT TCAAGCCTTG GTTCTGCCTT GTTTTTGCTT GGGTGGCCTT 
GAGTCAGCCA. CATACCTTTT AAAATCTCAA TTTATTAGAA ATTATTCCAA ATCAAAATCA* 
AATGAGAAGG TATATACAAA AGTGCTTTAT CCCACAATAA ACTATTCAAG AGAGAGCA.-A 
GGAGAGGACA TTTACTCAAC ACCTCCTAAA AGGCAGCCAG TGAAATTAGG CATTTTATTT 
AATCCTCCTG GCAACTCTGA GAGTAAAGCA TTATTAATCC CATTTTGGCT GTTTAAAGAA 
ATTATTTGCA CTAGATTCCA GCTGTAGTTT AGYTTCAGAA AAAAAAATCC TGAGATGTGA 
ATTCACAGCT TTCTGGGTTT AAAGCCCAAG CTCTATCACA TCATGCTATT ATTGTTACAT 
TACTGCTAGT TCTATGAAAA • GAAATACTAA TTTATGAAAT ACATCTTATC CAAAAAAAAA 
AAAAAAAAAC TGGGAGGGGG GGCCCGTACC CAAATCGCCG GATAGTGATC GTAAACAACC 
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5 (2) INFORMATION FOR SEQ ID MO: 43: 

(i) SEQUEMCH CHARACTERISTICS: 

(A) LENGTH : 2432 base pairs 
(3) TYPE: nucleic acid 
10 (C) STRAMDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 43 : 

15 GGCACGAGGC CCGGAACGCT GAGGAAGGGC CCGTCCCGCC TTCCCCGGCG CGCCATGGAG 60 

CCCCGGGCGG TTGCAGAAGC CGTGGAGACG GGTGAGGAGG ATGTGATTAT GGAAGCTCTG 120 

CGGTCATACA ACCAGGAGCA CTCCCAGAGC TTCACGTTTG ATGATGCCCA ACAGGAGGAC 130 

20 

CGGAAGAGAC TGGCGGASTG CTGGTCTCCG ' TCCTGGAACA GGGCTTGCCA CCCTCCCACC 240 

GTGTCATCTG GCTGCAGA.GT GTCCGAATCC TGTCCCGGGA. CCGCAACTGC CTGGACCCGT 3Q0 
25 TCACCAGCCG CCAGAGCCTG CAGGCAYTAG CCTGVTATGY TGACATCTCT GTCTCTGAGG " . 360 

GGTCCGTCCC AGAGTCCGCA GAC ATGGATG TTGTACTGGA GTCCCTCAAG TGCCTGTGCA 420 

ACCTCGTGCT CAGCAGCCCT GTGGCACAGA TGCTGGCAGC AGAGGCCCGC CTAGTGGTGA 430 

30 

AGCTCACAGA GCGTGTGGGG CTGTACCGTG ' AGAGGAGCTT CCCCCACGAT GTCCAGTTCT 540 

TTGACTTGCG GCTCCTCTTC CTGCTAACGG CACTCCGCAC CGATGTGCGC CANAGCTGTT 600 

35 TCAGGAGCTG AAAGGAGTGC GCCTGCTAAC TGACACACTG GAGCTGACGC TGGGGGTGAC 560 

. TCCTGAAGGG AACCCCCCAC CCACGCTCCT TCCTTCCCAA GAGACTGAGC GGGCCATGGA 720 

GATCCTCAAA GTGCTCTTCA ACATCACCCT GGACTCCATC AAGGGGGAGG TGGACGAGGA 780 

40 

AGACGCTGCC CTTTACCGAC ACCTGGGGAC CCTTCTCCGG CACTGTGTGA TGATCGCTAC 340 

TGCTGGAGAC CGCACAGAGG AGTTCCACGG CCACGCAGTA ASCCTCCTGG. GGAACTTGCC 900 

45 CCTCAAGTGT CTGGATGTTC TCCTCACCCT GGAGCCACAT GGAGACTCCA CGGAGTTCAT 960 

GGGAGTGAAT ATGGATGTGA TTCGTGCCCT CCTCATCTTC CTAGAGAAGC GTTTGCACAA 1020 

GACA.CACAGG CTGAAGGAGA GTGTAGCTCC CGTGCTGAGC GTGCTGACTG AATGTGCCCG 1080 

50 

GATGCACCGC CCAGCCAGGA AGTTCGTGAA GGCCCAGGTG CTGCCCCCTC TGCGGGATGT 1140 
GAGGACACGG CCTGAGGTTG GGGAGA.TGCT GCGGAACAAG CTTGTCCGCC TCATGACACA ' 1200 

55 C CTGGACACA GATGTGAAGA GGGTGGCTGG' CGAGTTCTTG TTTGTCCTGT GCTCTGAGAG 1250 

TGTGCCCCGA TTCATCAAGT ACACAGGCTA TGGGAATGCT GCTGGCCTTC TGGCTGCCAG 1320 

GGGCCTCATG GCAGGAGGGG GCCCGAGGGC AGTACTCAGA. GGATGAGGAC ACAGACACAG 13 8 0" 

60 
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ATGAGTACAA GGAAGCCAAA. GCCAGCA.TAA ACCCTGTGAC CGGGAGGGTG GAGGAGAAGC ^1440 

CGCCTAACCC TATGGAGGGC ATGACAGAGG AGCAGAAGGA GCACGAGGCC ATGAAGCTGG 1500 

5 TGACCATGTT TGACAAGCTC. TCCAGGAACA GAGTCATCCA GCCAATGGGG ATGAGTCCCC 1560 

GGGGTCATCT TACGTCCCTG CAGGATGCCA TGTGCGAGAC TA.TGGAGCAG CAGCTCTCCT 1S20 

CGGACCGTGA CTCGGACCCT GACTGAGGAT GGCAGCTCTT CTGCTCCCCC ATCAGGACTG 1630 

10 

GTGCTGCTTC CAGAGACTTC CTTGGGGTTG CAACCTGGGG AAGCCACATC CCACTGGATC 1740 
CAGACCCGCC CCCACTTCTC GATCTTAGAA ACCCCTTCTC TTGACTCCCG TTCTGTTCAT ' 13 00 

15 GATTTGCCTC TGGTCCAGTT TCTCATCTCT GGACTGCAAC GGTCTTCTTG TGCTAGAACT 1360 

CAGGCTGAGC CTCGAATTCC ACAGACGAAG TACTTTCTTT TGTCTGCGCC AAGAGGAATG 1920 

TGTTCAGAAG CTGCTGCCTG AGGGGAGGGC CTACCTGGGC - AC ACAGAAGA GCATATGGGA 1980 

20 

■ GGGCAGGGGT TTGGGTGTGG GTGCACACAA AGCAAGCACC ATCTGGGATT GGCACACTGG 2040 

CAGAGCMANT GTKTTGGGGT ATGTGCTGCA CTTCCCAGGG AGAAAACCTG TCAGAACTTT 2100 

25 CCATACGAGT ATATCAGAAC ACACCCTTCC AAGGTATGTA TGCTCTGTTG TTCCTGTCCT 2150 

GTCTTCACTG AGGGGAGGGC TGGAGGCCTC TTAGACATTC TCCTTGGTCC TCGTTCAGCT 2220 

GCCCACTGTA GTATCCACAG TGCCCGAGTT CTCGCTGGTT TTGGCAATTA AACCTCCTTC 2230 

30 

CTACTGGTTT AGACTACACT TACAACAAGG AAAATGCCCC TCGTGTGACC ATAGATTGAG 2340 

ATTTATACCA CATACCACAC ATAGCCACAG AAA.CATCATC TTGAAA.TAAA GAAGAGTTTT 2400 

35 GGACAAAAAA AAAAAAAAAA AAAAAAAAAA AA ^ 2432 

40 (2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS : ' 

(A) LENGTH: 1742 base pairs • 
(3)" TYPE: nucleic acid 
45 (C) STPsAND EDNES 3 : double 

(D) TOPOLOGY': linear ^ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: • ; 

50 GTCCTGCAGG AGCTGCACGC GGCCGAGGTG CGCANGAACA AGGAGCAGCG AGAAGAGATG . 60 

TCGGGCTAAG GGCCCGGSAC GSGSGGCGCC CATCGTGCGA CGGAACACGT TCGGGTTTTG 120 

GTTTTGTTTC GTTCACCTCT GTCTAGATGG AACTTTTGTT CCTCCTCCCC CACCCCAGCC 130 

55 ■ ■ 

CCCAGCTTCA TGCTTCTCTT CCGCACTCAG CCGCCCTGCC CTGTCCTCGT GGTGAGTCGC 240 

TGACCACGGC TTCCCCTGCA GGAGCCGCCG GGCGTGRAGA CGCGGTCCCT CGGTGCAGAC 300 

60 ' ACCAGGCCGG GCGCGGCTGG GTCCCCCGGG GGCCCTGTGA GAGAGGTGGY GGTGACCGTG 360 
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GTAAAC CCAG GGCGGTGGCG TGGGATCRCG GGTCCTTACG ' CTGGGCTGTC TGGTC\GCAC ' 420 

GTGCAGGTCA GGGCAGGTCC TCTQAGCCGG CGCCCCTGGC O.GCAGGCGA GGCTACAGTA . 430 

5 

CCTGCTGTCT TTCCAGGGGG AAGGGGCTCC CCATGAGGKA GGGGCGACGG GGGAGC-GGGG 540 

TGATGGTGCC TGGGAAGCCT GCKTGTGCAN CCGGTGCTTG TTGAACTGGC AGGCGGGTGG 600 

10 GTGGGGGCTG C^GCTTTCCT TAATGTGGTT GCACAGGGGT CCTCTEAGAC CACCTGGCGT 660 

GAGGTGGACA CCCTGGGCCT TCCTGGAAGC CTGCAGTTGG GGGCCTGCCC TGAGTCTGCT 720 

. GGGG^j3TGGG CATTCTCTGC CAGGGACCCA TGAGCAGGCT GCATGGTCTA GAGGTTGTGG ' 780 

15 

GCAGCATGGA CAGTCCCCCA CTCAGAAGTG CAAGAGTTCC AAAGAGCCTC TGGCCCAGGC 340 

CCGTCCGTGG GACAGCCCCG CCGCCCCTCC CCACCAGGGC TTTGCAGATG TCCTTGAAAG 900 

20 ACCCACCCTA GAGCCCTTTG GAGTGCTGGC CCCTCCTGTG CCCTCTGCCC TGGTGGAAGC 960 

GGCA3CAOA GTCCTCCTCA GGGAGCCCCA AGGGGGATTT TKTGGGACCG CTGCCCACAG 1020 

ATCCAGGTGT TGGAAGGGCA GCGGGTAAGG TTCCCAAGCC AGCCCCAACA CCCTTCCCAC 1080 

25. 

-' TTGGCACCCA GAGGGGGCTG TGGGTGGAGG CCTGACTCCA GGCCTCTCCT GCCCACACCC I 140 

' TCTGGGCTGA GTTCCTTCTT TCCCTTGGAC GGCCAGTGGT GGCCTTGGAG GACGGTCAGC 1200 

30 TGGAGGATGG CGGTGGGGGA GGCTGTCTTT GTACCACTGC AGCATCCCCC ACTTCTCCAC 1260 

GGAAGCCCCA TCCCAAAGCT GCTGCCTGGC CCCTTGCTGT AAAGTG TG AA GGGGGC-OGCT 1320 

GAGTTCTCTT AGGACCCAGA GCCAGGGCCC TCAACTTCCA TCCTGCGGGA GGCCTTGGCC 1330 

35 

GGGCACTGCC AGTGTCTTCC AGAGCCACAC CCAGGGACCA CGGGAGGATC CTGACCCCTG 1440 

CAGGGCTCAG GGGTCAGCAG GGACCCACTG CCCCATCTCC CTCTCCCCAC OAGACAGCC ISOO 

40. CCAGAAGGAG CA3CCAGCTG GGATGGGAAC CCAAGGCTGT CCACATCTGG CTTTTGTGGG 1560 

ACTCAGAAAG GGAAGCAGAA CTGAGGGGTG GGATATTCCT CATGGTGGCA GCGCTCATAG 1620 

CGAAAGCCTA CTGTAATATG CACCCATCTC ATCCACGTAG TAAA3TGAAC TTAAAAATTC 1530 

45 

AATCAAATGA ACAATTAAAT AAACACCTGT GTGTTTAAGA A^AAAAAAAA AAAAAAACTG 1740 

CG 1742 
50 \ 



(2) INFORMATION ?0R SSQ ID NO: 30: 

55 (ij SEQUENCE CHARACTERISTICS : 

CA) LENGTH: 1437 base pairs 
(3) TYPE : nucleic acid • 
•(C) STHAfiJDEDNESS : double 
(D) TGPOLCGT: linear 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 50: 

GGCACGAGCC TCCGCGAACT GTGGAGTCGG CGGAGGGCTG GAATCAGCGT GGGCTCCAGG 50 

TCGCTGGCAG C CGGGTGGCA GAACTCTTCC GAGGCTCCTT GGGAAGAAGC TAG AC COG AG 120 

GGAGCCGGAT GGGCCTCGAA AACCTGGCCC GCTCTGGTTC TGT AC CATTG CAAGGGGAAC ISO 

CGTAAACTGA GCTTTTCTAA CGTGGGTTTC TGCCAAGTAC TTTTCCAGCT GCCCCCTTCC " 240 

CCCCAGCACA CAGGAGAGCC TCTGTGTAGC CAGCGCTTGA O.GTCGTTAG GTAGGTTGTA 300 

CTGTGTAGGG AGGAGCTCAA GATCATGAAT GGTTGTGAGA GGAGAAAGCG GTTGCATCTT 360 

15 TGCAAAACTA TATACCTGCT GTGGTTTGTG TTTTCTTTTC TGCTGAGTAA TGAAGTTGTA 420 

AGTTCACACT GGCACATTCT CAGGGCTGTG CAGATTATTT GCACTTTA.TT TCATAGGTGR 430 

ATAAGTGCTT 'TTTAGCTTTC TTTGTATATT GAGTTGCTTT TGAATTGCTT CCCATATTTT 540 

20 

TATTTCATAC AAACTGAACA ATTGTGGCCC CTCTATTTTA TTTATAAAGG TTCAGTGTAT 600 

CTTTGCCTGC CTACATCAAT CTGCAAGGGA GTTGCAGAAA GCCTCATGTT CATCGAGCCG 660 

25 TGAGTCACAA CCAATTTCTA AGCTGTTATA ACAAAAAAGT GTTTGCTTTT TTTCACAAGT 720 

AACTTTAAAA GTGTAGTTTA GAAAGAAAAC ATTTTCAATA AAAAGACACT ACATTAATCC 730 

TGGATGCTTG CAAATCCTAA AATKTATTCC TCCTCTAGCG TTGCAGAGCT CTGTGTTGTA 340 

30 - ' 

TACACAGACT AGCTTTAAAA TTTGTCACAT ACCACTTTAC CTTTACTTTT ATGTATCATT 900 

CCCCCGACTT CCTTACTGCA GCTGTGGGCA AGAAAACTTT TCCTTTAACA CTTTTCAACA 960 

35 GCGGGCATAA AATTCTGCAG CTGAGGTCTT GAAGAATGCA GATGGGTACA GTATGTGTTG 1020 

GAGCTCACAG TGTGTATTGA CTAACCTAGT TCCTTTTTTG CTTTTTTTGG TATTGTCTTG '1,03-0 

TTAAAAGTGA CTCCCAGGTA GCAACTCTCT TTTTTAAGGG TGGGAACGAA AGGGACGTAG 1140 

40 

GAAGAATA.GA TCTAGATTAT TTAACAGTCT TCGATAGAGT TTGAAAGCTT TCTTCTTCAT 1200 

TCAATTTTGG GCAAAATACT GCCTCTGCAT TTGTTCATAA C\AAAAGATT AGATTAATAA 1260 

45 GTAGCTTTTG TTGGTGGAAA . TTACCAGCTC TATAAGTCAC CCTTGGTGGT TCATGGACCT 1320 

CTGATTAGCT TGGGTTTTGC AGTCTCATTG CCACATGTAT ATGTGGAGCC AATGGCCTTT 1380 

TGGTGCTCAG CTGTTTACGT CTGACTCCTT GACTTCTTTG GTACAGTGAT GGAGTCAGAT -' 1440 

CTCATTAAGT' GTGATTCTCC ATGGATATAA CCAGCCCCAA AAAAAMG ■ 1437 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 51: 

( i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1323 base pairs 
(3) TYT?E : nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(;ci) SEQUENCE DESCRIPTION: SEQ ID MO: 51: 

5 

GGCACGAGCT CGTGCCGAAT TCGGCACGAG AGAAGATTTG AAGAAGCCAG ATCCAGCTTC' 60. 

CCTGCGGGCT GCTTCTTGTG GGGAAGGGAA AAAGAGGAAG GCCTGTAAGA ACTGCACCTG 12 Q 

10 TGGCCTTGCC GAAGAACTGG AAAAAGAGAA GTCAAGGGAA CAGATGAGCT CCCAACCCAA 130 

GTCAGCTTGT GGAAACTGCT ACCTGGGCGA TGCCTTCCGC TGTGCCAGCT GCCCCTACCT 240 

TGGGATGCCA GCCTTCAAAC CTGGGGAAAA GGTGCTTCTG AGTGATAGCA ATCTTCATGA 3QQ 

TGCCTAGGAG GTTCCTGACA TGGGACCCAT CTGCTCCTCC AGCCAACTCC TGTCCCTCAC 360 

ATCCCACCAT GGTGGCTCCT CCCACCTCCT CTGGATTTGT TCACTCTGAG ATCTGTTTGC 420 

20 AGAGTGGGTG CTTAGCAGAC AGAGTGAAGC TGGCTGGGGG GCACAGTGGT GTGTAGTGCT 430 

GCTGTGTATC AAAAGACCAA GGTATTATGG GACCTGGTTT CAGAATGGGA TGGGTTTCTT 540 

CACCTCATGT TAAGAGAAGG GAGTGTGTCC TGAAGAAGCC CTTCTTCTGA TGTTAAAATG 600 

25 

CTGACCAGAA CGCTCTTGAG CCCAGGCATC GTTGAGCATT AACACTCTGT GACAGAGCTG 660 

CAGACCCCTG CCTTGAGTCT CATCTCAGCA ATGCTGCCAC CCTCTTGTCT TTCAGAGTTG 720 

30 TTAGTTTACT CCATTCTTTG TGACACGAGT CAAGTGGCTC ACAACCTCCT CAGGGOjC-CA 730 

GAGGACTCAC TCACTGGTTG CTGTGATGAT ATCCAGTGTC CCTCTGCCCC CTTCCATCCC 340 

CAAC CACATT TGACTGTAGC ATTGCATCTG TGTCCTGTTG TCATTTATGT TAACCTTCAG 900 

35 

GTATTAAACT TGCTGCATAT CTTGACATAT CTTGAGATTC TGCATGTCTT GTAAAGAGAG 960 

GGGATGTGCA TTTGTGTGTG ATGTTGGATA GTCATCCACG CTCAGTTTGG ACCATTGGAG 1020 

40 GAACTTAGTG TCACGCACAA ATGGGGCTAT TCCTACGCTT AGAATAGGGC TTGTCTGCCC 1030 

ACTTTAGAAG AGTCCCAGGT TGGTGAGCAT TTAGAGGGAA GCAGGGCAGA ACTCTGAACG 1140. 

ACAATACGTC TCTCTGAGCA GAGACCCCTT TGTTCTTGTT ATCCACCCAT ATGGACTTGG 1200 

45- 

AATCAATCTT GGCAAATATT TGGAGAGATT GTGTGGATTT AAGAGACCTG GATTTTT AT A ' 1250 

TTTTACCAGT AAATAAAAGT TTTCATTGAT ATCTGTCCTT GAAAAAAAAA AAAAAAAAAA 13 2 Q 

50 AAACTCGA 1323 



53- (2) INFGRMAT ICN FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1356 base pairs 

(B) TYPE: nucleic acid 
60 (C) STSAMDEENES5 : double 



WO 98/54963 



PCT/US98/11422 



511 



(D) TCrOLCG": : 



(>:i) 



10 



15 



20 



2d 



30 



JO 



40 



45 



50 



DO 



60 



GAAJTTCGGCA CGAGCTCTGC A-jCATT 
CCTAGATTAA ATTCCCC1 



TGCTGTCTTC AATTAA.ACCA TT*: 




CTGA.-_-.rrCL-- GTT 



TTTCCAGGCC TTCCTTCTTT GTACAAAA 
TTCA.AACATG ATGrr.-_-.TTT 
TTTGCCACTG TTArTAGTTC 
TTTGATTATC TTTCTATCTC TTTTAT 



A-ArGrrrAr aaagc gtttc •actta.ta.ttc 
err c ctagga rArrrrATTA. ttcctatgat 





;tctt tcttgcagtt 

TAGTTTGATA GATCTGCAAG CTA.TGCTGCT TCa-.rGAA.GT TAGCTGCGGT GGTAGGAA.CG 

cajggcttctt tgtgtctggt tgtagcttgt ATGArcGcrr ca— ggcag acaacgtagc 

CGGAGATCAC AAATGAGGCC CTTGGT CTAG TTGCTAGTGT GTGGAGGTGC PC-AGAGGTTG 
GCAGAAACTG ACCTCACTGG GCAAGGGTGG CeATGC-ACCt GArTCTTTAA. TGCACTCTAT 
GTGTTCAGGA AGC GAC AGGC CATATTTGAC TCCGAGAAAG AAA--IAA.G.-G G-AAAACCCC 
ACAAAGTATA ACAACCCCTT AAGATA. CATC TATTTTAAAG TGAAATTAA.T TTTTCAGTTT 
ATACCATTGG CC-ATTACAA -G-.rAA-AA.rG TTCAATTTrr rT.A--GAATGC TTTGTTGACT 
TGTCTTTTCA TCTCTTGCTA riTATArrTG TCA_TG_TAC- TG.A-.CAAAjGT CTTATTTGCT 
GAGGAAGGAC TTTGCTGCAC TT.ACTGTAG T AC-.rG.-AA.CA CTGGGG-.GGG TGGTGTTTAA 
CTTTTTAAAA AATGITATTC TGATTATAAJT A-.rAATATCC- GTTTTTTTCA TGAAAA.G-.GC 
GCCACCTTGC AAGGTTTAGT GAGA.TTTATG G-A.CTTGAA: Ar~ A.GCAG GAATTGCTGC 
TAC<rrCCAAA AA.TTTGCGAA GCAAAAGCTA GC CCCAATT' C 3~TC-GAAGT TTGAAACTGA 
TTAACAGATT TGCATTTGAA GTGACTCTAG ACATTAGGrr GAGACA.TTAG TTAAAAATAG 
AAAGAGGAAT AAAGACATCT YTTCTCrGTA GAAAAGATAA G-GCr.pAm AATAATCCTT 
CCCACTTTGA TTG.-GATCAG CTTGTCTGAr AACGTGATAT GAGTGTGACA ATGATAAACA 
TGATAATAGT GvTACTTTTG TAATTTTGCT GGTC^LATTTA AGA-GATA.GT .vAKGA.TG.AG' 
TTCAYCTTTT CTiCGAAGAT YCCTACV C C*T AG.ATG-AGrr TAG TT CAAA.T TGGGAATTAT 
AACTGTCCTA ATTTTTGTTG TGTACCGTGA TGCCCCCTTr GCTTTAATAC CCACAGTGTA 
ACAATTAAAT ATCACACTAT GACATATGAT TTAAGTAC-G-. TATTTTAAAG ATAAATTTTA 



SO 
• 120 

130 

240 

300 

360 

420 

430 

540 

600 

660 

720 

730 • 

340 

900 

960 
1020 
1080- 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 

i6aa" 
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GGGGTAAATG TTTACTTCAA AA.TGACTCCA TATTTCAAAT A.TCTGTTTAG ACTGTGAAGG 1740 

CCAAATAATT TTTAAGAAAA CATTTGAAGA GTAGTGTGTT TGCATTTGTG AA.TAATGTTA 1300 

5 CTCACAGCAA GTAAACGTAA TAAAAGCCAA CATTTAAGCC AAAAAAAAAA AAAAAA 1356 

10 (2) INFORMATION FOR 3EQ ID MG : 53: 

< . 

(1) SEQUENCE CHARACTERISTICS : ' 

(A)' LENGTH: 1558 base pairs 
(3) TYPE: nucleic acid 
15 t (C) STRANDEDNESS : double. 

'(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

20 TGGGTATCCA TTCCTGNAAT TACTTTACTT AGGATAATGG CCTCCAGCTC CGTCCAAGTT - 60 

GCTGCAAAAG GTATTATTTC GTTCCTTTTT GTGGCTGAGT AGTATTCCAT GGTGTATATA ■ 120 

TACCACATTT TCTTTATCCA CTCATTGCTT GATGGGCAGT TAGGTTGGTT CCACATCTTT 130 

25 

GCAATTGTGA GTTGTGCTGC TCCAGATATC ATCTTTAACT CCTTTGCCTT CTCCACATAC 240 

ATTTCCAAGT CCTGTTCATT CTACCTCCAA AATGTATCTT GTATCCATTC ATCTCTCTCC 300 ■ 

30 ATCTTCAATC TATTTCAATG CCC CATCATC TCTTGCATGG AGGAGTGTAA TAATTGGCTA 360 

ACTGGCCTGT TCTTACATTT TAAAA.TCAAA AGATGTGACA GGTGAAATGC CTATTTCAGT 420 

GTCCATTGAT GGTTCTGCTT ACACACCACC TGGCTGCCTG' GTGTCGCAGT GGC AGAGTTG 43 Q 

35 

AGCAGTGTGA AAAAGACTGC TTGGCCCTTT ACAGGGAAAG CAGGTCCACT GTGGCCTGTG 540 

AGGACGAGAG CTCTGGGCAG GCTCGGACAC TGGCAGACCC TGGTCCTGGC TGGCCAAGGC 600 

40 AGCAGGGTAT GTGTTTCGGG TCACTCACAG GGCTCAGCA.C CACTCCTCAT GGCTTCCTTA boQ 

CTGTTTCGGC AGAGGCTGAC CCGCGGCTGA TTGAGTCCCT CTCCCAGATG CTGTCCATGG 720 

GCTTCTCTGA TGAAGGCGGC TGGCTCACCA GGCTCCTGCA GACCAAGAAC TATGACATCG 780 
45 - 

GAGCGGCTCT GGACACCATC CAGTATTCAA AGCATCCCCC GCCGTTGTGA CCACTTTTGC 340 

CCACCTCTTC TGCGTGCCCC TCTTCTGTCT CATAGTTGTG TTAAGCTTGC GTAGAATTGC 900 

50 AGGTCTCTGT AGGGGCCAGT TTCTCTGCCT TCTTCCAGGA TCAGGGGTTA GGGTGCAAGA 960 

ACCCATTTAG GGCAGCAAAA CAAGTGACAT GAAGGGAGGG ■ TCCCTGTGTC TGTGTGTGCT 1020 

GATGTTTCCT GGGTGCCCTG GCTCCTTGCA GCAGGGCTGG GCCTGCGAGA CCCAAGGCTC 1080 

55 

ACTGCAGCGC GCTCCTGACC CCTCCCTGCA GGGGCTACGT TAGCAGCCCA GCACATAGCT 1140 

TGCCTAATGG CTTTCACTTT CTCTTTTGTT TTAAATGACT CATAGGTCCC TGACATTTAG 12Q0 

60 TTGATTATTT TCTGCTACAG ACCTGGTACA CTCTGATTTT AGA.TAAAGTA AGCCT AGGTG 1250 
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TTGTCAGCAG GCAGGCTGGG GAGGCCAGTG TTGTGGGCTT CCTGCTGGGA CTGAGAAGGC 
TCACGAAGGG CATCCGCAAT GTTGGTTTCA - CTGAGAGCTG CCTCCTGGTC TCTTCACCAC 
TGTAGTTCTC TGATTTGCAA ACCATCAGCT GCTTTTAAAA T.AAGATCTCT TTGTAGCCAT 
CGTGTTAAAT TTGTAAACAA TCTAATTAAA. TGGCATCAGC ACTTTAACCA A-AAAAAAAA 
AAAAAAAAAA AAANAAAAAA AAAAGGGGGG CGGTGTAGAG GTCCAAGTTA MGACGNGG 



(2) INFORMATION FOR SEQ ID NO": 54: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 943 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 

" (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

TAAAAATCAT GCTCTGTACC ATCCTCACCG TAGTCATCAT CATCGCCGCG CAGACCACGA 

GAACTACTGG GATCCCTAAA AACGCCCCTG GTCCGGCCCC ACTCTGCGCC CCTCGATCTC 

CCAGGCTCTT TCTGCAGWCA TACCGCGGAC CCAATGGGCG CCCTGCACAC CCGTTTCTGG 

GGCCGTCAGA CTTGGATACA TCGTAAACTC CGCCTCCACG GAACGTCTCG CCTKGCGAGC 

AAGMTCGGAA TCCAGTTCCT CAGGAACCCC TCCAAAACCC ACACCCCCAG GGACGCCGCT 

TTC CGGGATC CCGGSCAAAC GCCGGACCCT CAGTCGCTCC AGGCCCCCTC ACCCTCAAAG 

TGTAGCGCCC CCAACCGAGC AACCTCGGTT TGGTCCCTAA AACCCCGCCT CCTCTA.TAAG 

CACCGCCCCA GCTCTGACAA AACCCCGCCT CCAGGTCGGC AGGCTCCGCT TCTTTTCTTC 

TCCGCGGGGT GATTCAGTCC AGTGATTGGG TTTGTGGCTC CAGGC CTCGC CCACAGACGG 

ACAGACCCCT CCCTTTCTTC CGGCAAAAGG ACCGAGCCCT GGGGTAGTAA GGSCCCCACA 

CTCCTGTTTT TTGCAAGTAC ATTTTTGTCC YTCCTCCACC CAGGTATCTG CCTATTTTCT 

TGCTAATCCC AGAACCTTTC CTTTTGCTTT TTTTAAGGAC ATTTGGGAAG TTCCTGGTGT 

AGGACCCTTC TCCCTGGGAT AAGAAACCTG CCTGTAAACG CTCTGTAAAT ACTCCCTTCC 

ACCCATCCCA GCCCCTGGGC AGCCGGGCAG AAGGGAATCC AGGCTATGGA. CCTCCCAAGT 

CCCCGCTCCC CGCTCCCCTC GGCGGCCCCG CCTTGTTCTG ATCTGTGTGT GAGTGTGTGT 

GAACTTCTGA AAGACAAT AT , TAAAGAGACT TAGTTGAAAA AAAAAAAA 



(2) INFORMAT ION FOR SEQ ID NO: 55: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 990 base pairs 
(3) TYPE: nucleic acid 
5 (C) STHANDEDNE3S : double 

(0) TOPOLOGY : linear 

(;ci) SEQUENCE DESCRIPTION: SZQ ID NO: 55: 

10 GGGGAACTGC AGTGACAGCA GGAGTAAGAG TGGGAGGCAG GACAGAGCTG GGACAOjGGT 6Q 

ATGGAGAGGG GGTTCAGCGA GCCTAGAGAG GGCAGACTAT CAGGGTGCCG GCGGTGAGAA 120 

TCCAGGGAGA GGAGCGGAAA CAGAAGAGGG GCAGAAGA-CC GGGGCACTTG TGGGTTGCAG 130 

15 

AGCCCCTCAG CCATGTTGGG ACCCAACCCA CACTGGCTAC CAGGTCCCCT ACACAGTCCC 240 

GGGCTGCCCT TGGTTCTGGT GCTTCTGGCC CTGGGGGCCG GGTGGGCCCA GGAGGGGTCA 3Q0 

20 GAGCCCGTCC TGCTGGAGGG GGAGTGCCTG GTGGTCTGTG AGCCTGGCCG AGCTGCTGCA 360 

GGGGGGCCCG GGGGAGCAGC CCTGGGAGAG GCACCCCCTG GGCGAGTGGC ATTTGYTGCG 420 

GTCCGAAGCC ACCACCATGA GCCAGCAGGG GAAACCGGCA ATGGCACCAG TGGGGCCATC 430 

25 

TACTTCGA.CC ' AGGTCCTGGT GAACGAGGGC GGTGGCTTTG ACCGGGCCTC TGGCTCCTTC 540 

GTAGCCCCTG TCCGGGGTGT CTACAGCTTC .CGGTTCCATG TGGTGAAGGT GTACAACCGC 600 

30 CAAACTGTCC AGGTGAGCCT GATGCTGAAC ACGTGGCCTG TCATCTCAGC CTTTGCCAAT 650 

GATCCTGACG TGACCCGGGA GGCAGCCACC AGCTCTGTGC TACTGCCCTT GGACCCTGGG 720 

GACCGAGTGT CTCTGCGCCT GCGTCGGGGG MAA.TCTACTG GGTGGTTGGA AATACTCAAG 7a 0 

35 

' TTTCTCTGGC TTCCTCATCT TCCCTCTCTG AAGGACCCAA GTCTTTCAAG CACAAGAATC 340 

CAGCCCCTGA CAACTTTCTT CTGCCCTCTC TTGCCCCAMA AACAGCAHAA GCAGGAMAJMA 900 

40 NACTCCCTCT GGCTCCTATC CCACCTCTTT GCATGGGAAC CTGTGCCAAA CACOCAAGTT 960 

TAAGAAAAAA ATAAAACTGT GGCATCTCCA 990 



45 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1603 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNE3S : double 

(D) TOPOLOGY. :. linear 

55 Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GGTCGACCCA CGCGTCCGGC CCGCCGGCTC CGGAGCGGCT CTGCCTTCCC GAGCGCGGGA 50 
CCGCGCCCTG GGGGAGGAGG GCGAACGACG CGGCGATGGC TCCGCGGGCA CTCCCGGGGT 120 

60 
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CCGCCGTCCT AGCCGCTGCT GTCTTCGTGG . GAGGCGCCGT GAGTTCGGGG CTGGTGGCTC 

j 

CGGACAATGG GAGCAGCCGC ACATTGCACT CCAGAACAGA GACGACCCGG TCGGGGAGCA 
ACGATACTGG GAATGGACAC CCAGAATATA TTGCATACGC GCTTGTCCCT GTGTTCTTTA 
TCATGGGTCT CTTTGGCGTC CTCATTTNGC CAMCTNGCTT NAAGAAGAAA GGCTATCGTT 
GTACAAGAGA AGCAGAGCAA GATATCGAAG AAGAAAAAGG TTGAAAAGWT AGRA.TTGAAT 
GACAGTGTGA ATGAAAACAG TGACACTGTT GGGCAAATCG TCCACTACAT CATGAAAAAT 
GAAGCGAATG CIGATGTYTT AAAGGCGATG GTAGCAGATA ACAGCGTGTA TGATCCTGAA 
AGCGCCGTGA CCCCCAGCAC ACCAGGGAGC CCGGCAGTGA GTCGTGGGCT TTGTCACCAG 
GGGGGACGCC AGGGAAGCAC GTCTGTGGCC ATCATCTGCA TACGGTGGGC GGTGTWGTCC 
AGAGGGATGT GTGTCATCGG TGTAGGCACA AGCGGTGGCA GTTTATAAAG CGCACTAACA 
AGTCCAGAGA ' GAGCAGACCA CGGCGCCAAG GGGAGGTCAC GGTCCTTTCT GTTGGCAGAT 
TTAGAGTMAC AAAAGTGGAG CACAAGTCAA ACGAGAAGGA ACGGAGAAGC CTGATGTCTG 
TTAGTGGGGC TGAAAC CGTC A-.TGGGGAGG TGCCGGCAAC ACCTGTGAAG AGAGAACGCA 
GTGGCACAGA - GTAGCAGGTG AGGCGTGGTT TTGGTGAGAT TGGGGGCAGA GTGGTGCAGG 
GTGAGGAGAA GGTACTTGGA GCCTCCCAGG TGCTGTGGCA- GCATAGGAAT GGTATTTGAG 
AGGGAAGTGG GAGAGCTTTG CTTGACCCAG GAAGACTGAG GGGGACTGAA CATGATTACT 
TGTCTGCCTA GAGGTTCTTG TAAAGAAGTC ACAAACTTAG TGCCTCCAGG GGCTTGGCTG 
TGTGATAATG AGGATAGAGG" ATTACTTGTG AGGCAATGTG ' GGATGGTGGG GATTGTGGCA 
AAGTAGAATT CACATCACCC ACCATATAGG GCTTGGATTA CCACGAGGCA GAAAGCACCT 
AGTGTTGGTG CATCTTCTTA CGGAAAAAAG ACAAAATCCA GACTTGTAAA ATGTAAAATG 
ACTGATTTTC GATATTGGCA GCTTACTTTT TTTTTTTAAA CAACCA.TGCA GGCCAAATGA 
CTTGTAATCT TGTCACCATT TTTAGGTAAA GTGTGACTTG AAAAAGTCTG GAGCAAACAA 
ACCAATGCTT TTTCCTTTTA TTCTGTTGGR AACCAGTTTT CTTTGTGTCA CAGTTYTGAA 
ACCTCAATAC GAATATTTCT CTTCGCACCA AATATTTTGA GGGAATTGAA AAGGCACAGT 
GATTTATTTC TTGATTTGGG AATTTTAATT TTGGAAGACA ATT- 

(2)" INFORMAT ION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION u SEQ ID MO: 37: 

TACAGCTCAG GATGCCTGTA ACATTGTCAT CTCTGGGCTT CTGGGTCCTG CTTAGCCTGC SO 

5 

TTTTTCCCTG GAGGACTGAC CAGGGATGCG GCCCAGCAAC ATGTTACTAA ATCA.TACTCT 120 

CCTCCCTACC TTTCCCAGAC CTCTCACTCC TGCCTGGTGT TCCAACCCGT TCTGTGGCCA 13 Q 

" 10 GAGTATACAT TTTGGAACCT CTTCGAGGCC ATCCTGCAGT TCCAGATGAA CCATAGCGTG . 240 

CTTCAGCAGM AAGGCCCGAG ACATGTA.TGC AGAGGAGCGG AAGAGGCAGC AGCTGGAGAG 300 

GGACCAGGCT ACAGTGACAG AGCAGCTGCT GCGAGAGGGG CTCCAAGCCA GTGGGGACGC 3 60 

CCAGCTCCGA AGGACACGCT TGCACAAACT CTCGGCCAGA CGGGAAGAGC GAGTCCAAGG 420 

CTTCCTGCAG GCCTTGGAAC TCAAGCGAGC TGACTGGCTG GCCCGTCTGG GCACTGCATC 480 

20 AGCCTGAATG AGGCTGGCCA CCTGCCACTT TGCCCTGCCC TCTGCCTCCA GGGCTCCMCX 540 ■ 

MYCCTTCCTT TTCTTGGTGA AAGGCACCTC CTTTCCTGAT AATGAATGGT GTTCCCTTTG 600 

CTTGGCTGGG GAGCCCCCCA GGCGAGGTTT GCTGGCCATA GATACCTTTG GGCTGCCTGR 660 

25 

GACAGGCTCC TGAGGAGGAT TGAGGGTGAA AGTCTCCCAC GAGTACACTA AACCTAGGTC 720 

TGGTCACCAA TAGGGTTTGG AGAGCAAAGG GCCACAACTC ATCAGCTGCC TGTCTCTTAG 78Q 

30 • ATGCACTTTC TTTTTCCACC AGCACATCCT TCAACACACA GAATTTCAGG GAAGAGTTCT 340 

CCCCAAAACC CTAGCTCTTT ACCCTTCCAT TTTAGCCTTC CACCCAGCTT CCAjZAAAAGA 900 

TTTGGCTCTA CCTTGGATCT GCTAGTAAAT AACTAATAGG CAGGCAGTTA TTTGC-GTAAG 960 

35 

GAAAAAAGGG GTGGGAGAGA CAGAAAATTT GCCCACTGCT GCTCCTCCCC TTGGSTYTCC 1020 

ACCTGGGATT TGCTATTGAA TCTCTA.CCCT NN 1052 

40 • 

(2) INFORMATION FOR SEQ ID NO : S3 : . 

45 (i) SEQUENCE CiAFACTEEISTICS : 

(A) LENGTH: 3.14 base pairs 

(B) TY?E: nucleic acici 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 

(XL) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
ACNCGNTGGC GGCCGCTCTA GAACTAGGGG ANCCCCCGGG CTGCAGGAAT TCGGCACGAG ■ 50 
55 CATAGACTTT TAAACTGGTA CGGTTCTTAG AGATGGTCCT TGGCCTTCTG TTGTTGTTGT 120 
KGTTTTTTTC . TTTTTCTTCT TCTCCTTCTC CTTCTTCTTC TCTTCTCCTT CTTTCTTCTT 130 
TTTTTTTTCA GAGTCTTGCT CTGTCACCAA GACTGGAGTG AAGTGATGTG ATCTCGGCTT 240- 

60 
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ACTGCAACCT GGGAGGGAGA GGTTGCAGTG AGTCGA.GATG GTGCCATTGC TCTCGTTTGG 
GOA.OAGAG TGAAACTCTT G7CTCAAAAA AAAAAAAAAA ATGAGGTTTA AGACAGTTTT 
GTCATTACTG GTGGGATCTG GTCACACAAG ATAGCATTAA ACGTGACATG GCACATAAAA 
TTGGTTAAAA AATTTTGTTT TTTAATTACG TAATGTAAAA GCCCAACAAA C*CTTTATGC 
AAGATTGGAA TGTATCTTCA AATTCAGATT TAATAAACAT GTAAAGATCC TCTGTATATA 
AAAGTTGTAT TTAATCCCTT GTGCCCCAAG AATGCTATAA AAGATCCCAA GAATGTTATC 
TATGAAAAGA TAGCAATAGG GAATGGTGAA CAAATAATTT AATTTGCCAA TTCTAAAAAA 
CATGGACTTA AACGCCATGA AAACTTGGTT CCATAGTTTT AACTGTTTTA TGGTTCCAAT 
ACAAAACCA.G AGTGGTTTAC ATTCCACAAT NACCAAATTT GGATCCAATN TTGGGGTAAT 
TTTNGGTATT TGCCATGGGA' TACTATTCAT TTTT 

(2) INFORMATION FOR SEQ ID .NO: 59: ■*' > 

(i) SEQUENCE CHARACTER! STICS; 

(A) LENGTH: 1215 base pairs 
(3) .TY2E: nucleic acid 
. (C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 59: 

AGAGGAAGTC TTTTGCCAAG C'CTGTTCTCT GGACT AAC GO CATCCAGGCT GGGAGGGGAA 

GAGTGCTCTG CTACACTCGT CCCCCTCCTG CCTCATCTTC CTTCTCAGCC TTGGTTCCTG 

ATGGGAACAG AATGG AGGGC CTGAGAACAT ACTTTCTAAA TGCCTTTGAC CCAGGAACCG 

ATT ATC TATA TTTGTTCCCA TTTTCCTTCA CCGTGACATT CCAGCATTGT CTGACTGTGA 

GGTGGGCCTT TCAGAGCCTC CAGGTTCCTC AAAACAGGCC TGAGCGATGG GCATCACACC 

CTCTGCCTAC CCACRTGCCT GCTTACCTGC CAGATAACCA AGTGMAGATG TCTGCGAGTG 

GCTAGTTTTC AO,TTCTTAC TAGTGTTTGG VTCACCTTTG GGCAAAGGCC CCCTCTAGGC 

CTTGCCCCAC CTCCATCAAA CGCAGACACT GTAGTCAGAC CTCAGYAATA TAGGAGGCAA 

TAATCTTTTA ACAGTGTTTT GCAAAGAAAC AAAAAGAGAA AAATCCCAGC CAGGGGAACT 

CGCCACCTGC CCACGCTAGT TCCATCCACG CTCAAGACCC GCCCTTAGAC CAGGCAGGCA 

AAGGCCCCCA TCACACTCGG CCACTAGTGG GGTGCTGAGG CCAAGAAAGA AACCAGACCC 

TGTATGACAA GTTGGGKTCT TTCCAGAACA CGACAGAAAC CCTTGTTAAT 

GCCACTCCAT ACTCCAGAAG CATTATTCCT TATTTGGGAC AGCCAAGGGC AGATTCACAG 

GTT ATTGT AG GAATAAAGAC TAGTTTACAA AGGARAAAGA GSCCCTGGAC TTCCCMAGGA 
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AAGGTCAGGT TAGGGCTCCT GTACCCATTC TGTTCCACCA CTGTTTGATC" TCTCTGGCCT 900 

CCCACCAGGA ATGCCGTTTC CTTTTTATGG ATCTGTTGGG AACCAGAGAG AATCAACAGA 960 

5 

TCAATGACAT AGGATCCGAA GTGCAATGAT ' AGTCACTTCT AGTTTGGCAT TTCACAAACT L020 

CTCNACAGCA AGGTATTGGT AGGTTACTCA ATTTCAAAAG GGCCCCATGG CCAAATAXGT 1080 

10 TTAGGAACCG CTGTTTGMAT TTCTTTTTTT GGAGACGCAT TGTATATAAT ATATGTCAAA 1140 

GGCTTTCGGA ATTCCTGCAG GAAAGAAATC AGCTTTGTTA AATCCNAAAA. AAAAAAAAAA 1200 

AAAAAAATAG ACTCG ' 1215 

15 



(2) INFORMATION FOR SEQ ID NO: 60: 

20 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 478 base pairs 

(B) TYPE : nucleic acid 

(C) STRAMDEDNESS : double 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : V SEQ ID NO: 50: 

ATTT CTTATG ACATGGGGGT TTGAATTGGT TGGCAAATGT TTAATTTTAA TATCCATAAT - 60 

30 

CAGTGAGGTC CTGCTGGCTG TAATCATTAA TTGTGAAATC TAAGGAGCTT AGTTCA.TGGC 120 

TCTAGAATTT CACAGAAAAR TGYGMTATGA TACGAGCATT AAGTTTATTT CTTCTGATCT 130 

35 TTGATGCAGC TTTGTTGAGT TTATCTGTTT TTGTATTTAT TGGTCATCTA CTTCCCATGC 240 

CAAAAGGGAC TGGTCTACAT AGCTGCGCTA AACACCTGAT CAAATCACTA AAAGAAAATG 300 

TGTTACCTCT AATGAATTAT CCTGATTGTA AGTTAAAAAT CAATATTTCC CCGTAGTGAG 360 

40- 

GTTTGCTTTT TAAAAAGAAK KCTTAAAAAA AAAAAAAAAA AAACGAGTTN AAGAAAAGGA 420 

AGCAAGCTCA GGTAAGGTGC ACACATTGGG CTAAGGAAGC TAGAGCCTGT GGAGANGC 47 3 

45 



(2) INFORMATION ? OR. SEQ ID NO: 61: 

50 (1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNES3 : double 

(D) TOPOLOGY: linear 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
TATGACCTTG ATAACCCCAA GTTNGAAATT AACCTTCANT AAAGGGAACA AAAGCTGGAG 60 
60 TTCGCGCGCT TGCAGTTCGA CACTAGTGGA TCCCAAAGAA TTCGGCACGA GTCATAATGA 120 
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GCTACTAGGT AAGCCTTCTG GGACTTTCAG ATATTTTGGG GAAGATTGAT TTTTGTTCTT ' 130 

ACATGCTGTG GACCCTTGGC CATCAAATGG TATGGGGAAG _ CTCATCCGTC TGTCTGTGAT 240 
5 ' 

GGTCA.TGTCA GTCAGGCGTC TTTTTAGTAT TTACTGGGTG CTCAGTACTG TGCCAGATGC 300 

TGTCGGGAGC CGTGGTGGTA TGGAGGAGGA GTGCTCCAGA GGACTCTGGT GTGTGGCAGG 360 

10 CCAGCATAAA CAAGCCAAGG GGAAAAGGCA GGCATGGAAT AAAGGGGGAG AATACCAGTG 420 

TGTGAGTTAC TGCTGACTGT GTGGATTAGC CTATCAGCAG TAATCAAGCA GGGCGGAGGG 430 

CATTATCTTT GAGCCAGAAG AGTGAGCACT GG3CCGAGGG TGGAGCAT CA AGAGGGGGTG 540 

15 . 

TAGGACCMCA AGGCTTCTTN CNGGGGAGAC AACGTCAATA AGCNGTCAGT AGTCACGGAC 600 

AGTTTTGGGA AGCAAGGG 613 

20 



(2) INFORMATION FOR SEQ ID NO: 62: 

25 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 751 base pairs 
(3) TYPE: nucleic acid 

(C) . STRAMDEDMES5 : double 

(D) TOPOLOGY: linear 

30 

" (xi) SEQUENCE DESCRIPTION: SEQ ID MO : 62: , 
TCGACCCACG CGTCCGAGGA GCTGGACTTC TGAGACAGCC ATTCTCCTTG CATAGCACTG 60 
35 TCTGCTGCTA CAGCTCATAG AA3TCAACAA TTTTCTTCAA CACTGGTAGG CAGCCTCTAA 120 
ATGGCCCTGA TCACCCTCAC CTCCTGCCAT TCACAGCNNT GTAAAATTCC ACCCCTGGAC 130 
CTAGTGACTC ACTTCTAACA ANGAGAATAC AGCAAAAGTA ACATCGCTTC TGAGGTGAGG 240 

40 

CTACAAGGAG ACTACGATGC CTGCCTTGGT. CACCCTTCTC CTGCTCTTTC CATTGCTCCC ~ 300 
TCTGATGGAA GCC AGTTGC C ATGTGATGAG GTGCCCTATG GAGAGGCCCA CGTGACAAGG -350 
45 TATTGTAAAA AGCCTCTGAC CAATAGCCAT CTAGAAACGG AGGCCCAGTC CAGCAGCCTC ' 420 

TGAGATGAAT CCTGCCAACC TGAGCTTGGA GACAGATTCT CTCCCTATCC TGCCTTGGGA 430 
TGATCACAGC CACCACCAAC ACCTTCACTG CCTGGTGAGA ' GGCCAAGCCA GTGAACCCAA ■'■ 540 

50 - ' 

GGTAAACTGG ACAGAATCCT GACCCACAGA AACTGAGATA ATGTTTGTTA TTTTAAGCTG 600 

CTCAGTTTGT TACAGAGCAA TAGATAACTA ACTCAAACAC CATAAAATTC TAATA'TTTTA 650 

55 TTCTATCACA CAAACCAGGT AATACCAAGT AAATGCCATT ACTATACACA TATTTTTGTA 720 

ACACAr.TTAC ATGTGATTTT TTAAGAAGGC T 751 
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(2) Z~C?:i-.TrCrl FOR 53.; ID MO: 53: 

{i} SEQUENCE Cw-=tCT2HZ3TIC3 : 
5 (A) LENGTH: 73 0 case pairs 

(3) TYPE: nucleic acid 
(C) STrA-XEDiiESc ; double 
(3) TC5C1CGV: linear 

10 (;ci) SEQUENCE CaSCSHFTICM: SEQ ID NO: 63: 

• c^ic^gtca c:gtccccga ttcccgggtc gacccaggcg tccgsgttgg caactcctga so 

GCCCTGCACG C-STG.-.CTTC-. CATTTTCCTA CCTCTCCTTC T^TCTCTTC' fAGAGCACCT 120 

15 

GCTATCCCCA ACTTCTAGAC C^GCTCC-AA CTAGTGACTA GGATAGAATT TGATCCGCTA 130 

ACTCVTIGTC TGCCGTGCTC ATTGCTGCTA AC-GCATTGC CTGTGCTCTC CTCTCAGGGG 240 

20 CAGCAZGCTA ACGGGGCGAC GTCCTAATCC AAGTGGGAGA AGCCTCAGTG GTGGAATTCC 300 

AGGCAGTGTO ACTGTCAAGC TGGCAAGGGC CAGGATTGGG GGAATGGAGC TGGGGCTTAG 360 

CTGGGAGGTG GTCTGAAGCA. GACAGGGAA.T GGGAGAGGAG GATGGGAAGT AGACAGTGGC 420 



2D 



3d 



40 



50 



TCG7ATCGCT CTGAGGCTCC CTGGGGCCTG CTCAAGCTOr TCCTGCTCCT TGCTGTTTTC 430 

TGATGATT7G G3GGC7TGGG AGTCCCTTTG TCCTCATCTG AGACTGAAAT GTGGGGATCC 540 

AGGATGGrCT TCCTTCCTCT TACCCTTCCT CCCTCAGCCT GCAACCTCTA TCCTGGAACC 500 

TG7CGTCCCT TTCTCCCCAA CTATGGAiCT GTTGTCTGCT CCTCTGCAAA GGCCAGCCAG 660 

CTTGGGAGCA GGAG-.GAAAT AAr.CAGCATT TCTGATGCCA AAAAAAAAAA. AAAAAAAACC 720 

GCGG2CGA-A. GrtTATTMCC CTTTAA.GTAA GGGGTTAATT TTTAGCTTGG jGCACTNGGCC 730 

(2) Z^OFZ-CATEON FOR SEQ ID NO: 54: 



(1) SZQUIXCZ a-tA?ACTH?Z3TICS : 

(A) LEMG~-£: 5S3 base pairs 
45 (3) TYPE: nucleic acid 

(C) STRAZ-jDEDNESS : double 
<D> TGPOLCGV: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID MO: 64: 

TTCCGAA.TTA ATCGAGTCAC TATAGGAAivT GCCGTCGCCA TGACCCGCGG TAACCAGCGT 60 

. GAGCECGCCC GCC-GAAGAA TATGAAAAAG CAG?£CGACT CGGTTAAGGC ' AAAGCGCCGA 120 

55 GATGACGGGC TTTCTGCTGC CGCCCGCAAG CAGAGGGACT CGGAGATCAT GCAGCAGAAG 130 

CAGAAAAAGG CAAACGAGAA GAAGGAGGAA CCCAAGTAGC TTTGTGGCTT CGTGTCCAAC 240 

CCTCnGCCC TTCGCCTGTG TGCCTGGAGC CAGTCCCACC ACGCTCGCGT TTCCTCCTGT 200 

60 • ; 
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AGTGCTCACA. GGTCCCAGCA CCGATGGCAT TCCCTTTCCC CTGAGTCTGC AGCGGGTCCC 
TTTTGTGCTT CCTTCCCCTC AGGTAGCGTC TCTCCCCCTG GGCCACTCCC GGGGGTGAGG 
GGGTTACCCC TTCCCAGTGT TTTTTATTGC TGTGGGGCTC ACCCCAAAGT ATTAAAAGTA 
GCTTTGTAAT TCCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AA A AAA AAAA AAAAAAAAAA AAAANNCGGG GGGGGGCCCC CCCCCCCC 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 774 base .pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNES3 : double 
(□) TOPOLOGY: Linear 

(:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

TTTAAAGATG AAGAAATGAC AAGGGAGGGA GATGAGATGG AAAGGTGTTT GGAAGAGATA 

AGGGGTCTRA GAAAGAAA.TT TAGGGCTCTG C^TTCTAACC ATAGGCATTC TCGGGACCGT 

CCTTATCCCA.' TTTAATTAAT TTCTCTGACA ATT CAATT AT TTTCTGTTAT TAATGTTGCC 

ACTGCTTTCT GTTTGTCTGC ACTTTCTTGA TAAA.TA.TTTG CTATCGTTTT ACTCCAGTCA. 

TTCGATGTTG CTGAGATTTA CATATGACTC TTGTCAACAT CTCATCTTTT GACCCAATCT 

TATTCATTTA ATAAGAGGTC TCATTCA.TTT GCATGGAAAA ATGCT CA.TTG TATATTGCAA 

AGTGAAAATA ACGAJGTTGCA AAACAGTGTA TACATATATG TGTGTA.TATA TGTACACTTT 

ATTTGTACAT TTCTATGTGA CATAATGCAA AGGAAAGTGT CTGATTTTAT TATACACCAA- 

AGGTTAACAG TGAATCTCTG TGTGATCTCT TTTTTTTTCT TTTTGG CT AT CTGCATCTTC 

TCACTTGCCA AAAAATGAAT ATATGTTTAT GTGTGTATAT TACTTGTGTC ACA1AAAAACC 

CTAAAGTAGA CAGTAAAAGA ACTTGTCAAT CGCCTTTGGA AGGCAATGAA ACACTTAATA 

AACTCTCAAT AACAGAAGCG TAAAAATGAA ATGTAAACCT CCAATTACCT CTGGATCTCT 

TAGCCAGAGT AATAAACTGG TAATTATTAC AGATAAAAAA AAAAAAAAAA AAMA 

(2) INFORMATION* FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

' (A) LENGTH: 13-66 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNE S3 : double' 

(D) TOPOLOGY : linear" 
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(xi) SEQUENCE DESCRIPTION: SEQ IDNG: 66: 

ACCCACGCGT CCGGTCCTCT TCTTCAGCAC ATGCCAAAGC TGTTCCTCAC C-GCCTGTGAG 60 

AjCAAjGAGCAT CTTGGATGTA GGACAA.TGGA AGAGTTAGAT GCCTTATTGG AGGAACTGGA 120 

ACGCTCCACC CTTCACGACA GTGATGAATA TTCCAACCCA GCTCCTCTTC CCCTGGATCA ISO 

- GCATTCCAGA AAGGAGACTA ACCTTGATGA GACTTCGGAG ATCCTTTCTA TTCAGGATAA . 240 

CACAAGTCCC ' TTGCCGGCGC AMTCGTGTA.T ACTACCAATA TCCAGGAGCT CAATGTCTAC 300 

AGTGAAGCCC AAGAGCCAAA GGAATCACCA CCACCTTCTA AAA.CGTCAGC AGCTGCTCAG 360 

15 TTGGATGAGC TCATGGCTCA CCTGA.CTGAG ATGCAGGCCA AGGTTGCAGT GAGAGCAGAT 420 

GCTGGCAAGA AGCACTTA.CC AGACAAGCAG GATCACAAGG CCTCCCTGGA CTCAATGCTT 430 

GGGGGTCTSG AGCAGGAATT GCAGGACCTT GGCATTGCCA CAGTGCCCAA GGGCCATTGT 540 

20 

GCATCCTGCC AGAAACCGAT TGCTGGGAAG GTGATCCATG CTCTAGGGCA ATCATGGCAT 600 

CCTGAGGATT TTGTCTGTAC TCATTGGAAA GAAGAGATTG GCTCCAGTCC CTTCTTTGAG 660 

25 CGGAGTGGCT TGGNCTACTG CCCCAACGAC TACCACCAAC TTTTTTCTCC ACGCTGTGCT 720 

TACTGCGCTG CTCCCAJTCCT GGATAAAGTG CTGACAGCAA TGAACCAGAC CTGGCACCCA 730 

GAGCACTTCT TCTGCTCTCA CTGCGGAGAG GTGTTTGGTG CAGAAGGCTT TCATGAGAAG 840 

30 

GACAAGAAGC CATATTGCCG AAAGGATTTC TTAGCCATGT TCTCACCCAA GTGTGGTGGC 900 

TGCAATCGCC CAGTGTTGGA AAACTACCTT TCAGCCATGG ACACTGTCTG GCACCCAGAG 960 

35 . TGCTTTGTTT GTGGGGACTG CTTCACCAGT TTTTCTACTG GCTCCTTCTT TGAACTGGAT 1020 

GGACGTCCAT TCTGTGAGCT CCATTACCAT CACCGCCGGG GAACGCTCTG CCATGGGTGT 1030 

GGGCAGCCCA TCACTGGCCG TTGTATCAGT GCGATGGGGT ACAAGTTC CA TCCTGAGCAC 1140 

40 

TTTGTGTGTG CTTTCTGCCT GACACAGTTG TCGAAGGGCA TTTTCAGGGA GCAGAA.TGAC 1200 

AACACCTATT GTCAACCTTG CTTCAA.TAAG CTCTTCCCAC TGTAATGCCA ACTGATCCAT 1260 

45 AGCCTCTTCA GATTCCTTAT AAAATTTAAA CCAAGA.GAGG AGAGGAAAGG GTAAATTTTC 1220 

TGTTACTGAC CTTCTGCTTA ATAGTCTTAT AGAAAAAGGA AAGGTGATGA GCAAATAAAG 1330 

GAACTTCTAG ACTTTACATG ACTAGGCTGA TAATCTTATT TTTTAGGCTT CTATACAGTT 1440 

50 . 

AATTCTATAA ATTCTCTTTC TCCCTCTCTT CTCCAATCAA GCACTTGGAG TTAGATCTAG 1500 

GTC CTTCTAT CTCGTCCCTC TACAGATGTA TTTTCCACTT GCATAATTCA TGCCAACACT 1560 

55 GGTTTTCTTA GGTTTCTCCA TTTTCACCTC TAGTGATGGC CCTACTCATA TCTTCTCTAA 1520 

TTTGGTCCTG ATACTTGTTT CTTTTCACGT TTTCCCATTT CCCTGTGGCT CACTGTCTTA 1530- 

CAATCACTGC TGTGGAATCA TGATACCACT TTTAGCTCTT TGCATCTTCC TTCAGTGTAT 1740 

60 
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TTTTGTTTTT CAAGAGGAAG TAGA.TTTTAA CTGGACAACT TTGAGTACTG ACATCATTGA 1300 
TAAi^.TAAA-CT GGCTTGTGGT TTCAATAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1360 
AAAAAA 



1366 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHAPA.CTE3.IST ICS : 

"(A) LENGTH: 1152 base pairs 
.(B) TYPE: nucleic acid - 

(C) STRAMDEDHESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67; 

CTCAAGGATG TAAAGGCTCT GCAGATTTCG GGAGGCCTGT CTCCCAGCAC CTGATGGGAC 60 

ACTTTTTGCC CCACTGTAAA TTCTGGGTGT ATCCTCCACT GTATGCTGTC AC CCCAAGGG 120 

CAAGCACTGC ATCTGCTTAG TGAAGGATTT ATTGTTCGGA AGATACATTT TCCCCTTKAG 130 

CAGAGAGTGG CGTATCCTGG CAGTCTTCGG TGAGCCAGTT GTACCAGGAT TATGAAATGC 240 

AGATGTTTAC TGTGTCATTG TTGCTGTCAT TGCTACTGAG GAGTACTGAC CAGAA.TCATC 300 

TGCAACTYTT AGTTGGCAGA GAGGACCACT ATGGCGGGTA GCTCTTTTCT TTCCTGCCAT 360 

TGTGGGGATG ATTCCAGGCC AAAGATGATG GARAAGTATG GAAATCATCT GAAAGGTTGA 420 

AGCTTGGCAC GTGAAGCCAT TCATGACTTT GTAAGGCAGT TTTGCTGAAG GCCAGTTCTG -430 

CCCTGGGAGG GACGGAGGTG AATCCTCCTG AGTACCTGTG GTTTTCTTAC TTCCTGCTGA ■ ' 540 

ATTTACCTAA GTGCCTGTTG TTTGCTTGCT GTGGAGGCTT TCTGGTATTT CATTTCAGGT 600 

GCAGATGCCT TCACTTTCCC ACC3AAAAAA CCCCMACCAA ACCTAAGACC TTA.CTGCAAC 660 

TAAGTYTMCC AAGTACTTTT TAACCCAATG GGATGAACAG CCTGTGGTCT GCTCAGATCA 720 

CCCTGAGTGC GTGTGAGAAG GCMTNGGCTT TGCCAGGAAA TCCAGGAAGG CAGGGCCGGG 730 

CTGTGTTGGA AGCTGGCTTA GCTGGTGGGG CAGCCTTATT TCAATTAAAA GGGCATTGAC 840 

TGGGAGCAGC AGTCCTGGAG TTTGTTGCAT TTCCTATTGC CCTCAAAATG AGAAACCAGG 90 Q 

AAAATAGCAG ATTGGAGCCT TCGAGAAGGC AGTAAATGGC TGTTTTTATT GACAAAAGGA 960 
AAACATTTTA CTGCCATCTC ACTGATGGCA TCTCACTGAC TTAAAATGAA GGCANGTTGT . 1020 

AGTAAAAAAA AAAGTCTACA TTTTTCCACC GCCACGTTCT TATATCCTGT TTGTCAGCCA 1030 

CTGCTCAMAA GGGCATGTTG TCTTGCGGAM TANAGGCGCT CTCCTTCCCT CGTTTTCCCT 1140 

ATAGGTTGGG TG * 1152 
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(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2433 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(;ci) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
A.GCAGGCGGT GCGCTGGGGG CGGGAGCAGC GCGKAGCCCG GCTCGGCCA.C ACCGATCGCC 
CGCCGCCATG GGCTCCTCGC AAAGCGTCGA GATCCCGGGC GGGGGCACCG AGGGCTACCA 
CGTTCTGCGG GTACAAGAAA ATTCCCCAGG ACACAGAGCT GGTTTGGAGC CTTTCTTTGA 
TTTTATTGTT ■ TCTATTAATG GTTCAAGATT AAA.TAAAGAC AATGAGACTG " TTAAGGATCT 
GCTGAAASCA AACGTTGAAA AGCCTGTAAA GATGCTTATC TA.TAGCAGCA AAACATTGGA 
ACTGCGAGAG ACCTCAGTCA CACCAAGTAA CCTGTGGGGC GGCCAGGGCT TATTGGGAGT 
GAGCATTCGT TTCTGCAGCT TTGATGGGGC AAATGAAAAT GTTTGGCATG TGCTGGAGGT 
GGAATCAAAT TCTCCTGCAG CACTGGCAGG tCTTAGACCA CACAGTGATT ATATAATTGG 
AGCAGATACA GTCATGAATG AGTCTGAAGA TCTATTCAGC CTTATCGAAA CACATGAA.GC 
AAAACCATTG AAACTGTATG TGTACAACAC AGACACTGAT AACTGTCGAG AAGTGATTAT 
TACACCAAAT TCTGCATGGG GTGGAGAAGG CAGCCTAGGA TGTGGCATTG GATATGGTTA 
TTTGCATCGA ATACCTACAC GCCCATTTGA GGAAGGAAA.G AAAATTTCTC TTCCAGGACA 
AATGGCTGGT ACACCTATTA CACGTCTTAA AGATGGGTTT ACAGAGGTCC AGCTGTCCTC 
AGTTAATCCC CCGTCTTTGT CACCACCAGG AACTACAGGA ATTGAACAGA. GTCTGACTGG 
ACTTTCTATT AGCTCAACTC C AC CAGCTGT CAGTAGTGTT CTCAGTACAG GTGTACCAAC 
AGTACCGTTA TTGCCACCAC AAGTAAACCA. GTCCCTCACT TCTGTGCCAC CAATGAATCC 
AGCTACTACA TTACCAGGTC TGATGCCTTT _ ACCAGCAGGA. CTGCCCAACC TCCCCAACCT 
CAACCTCAAC CTCCCAGCAC CACACATCAT GCCAGGGGTT GGCTTACCAG AACTTGTAAA 
CCCAGGTCTG CCACCTCTTC CTTCCATGCC TCCCCGAAAC TTACCTGGCA. TTGCACCTCT 
CCCCCTGCCA TCCGAGTTCC TCCCGTCATT CCCCTTGGTT CCAGAGAGCT CTTCTGCAGC 
AAGCTCAGGA GAGCTGCTGT CTTCCCTCCC GCCCACCAGC AACGCACCCT CTGACCCTGC 
CACAACTACT GCAAAGGCAG ACGCTGCCTC CTCACTCACT GTGGATGTGA CGCCCCCCAC 
TGCCAAGGCC CCCACCACCG TTGAGGACAG AGTCGGCGAC TCCACCCCAG TCAGCGAGAA 
GCCTGTTTCT GCGGCTGTGG ATGCCAATGC TTCTGAGTCA CCTTAACTTT GAACCATTCT 
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TTGGAATTGG CGTGGTATAT TTAACCACGG GAGCGTGTCT G3AAACGCAA ACTATCATTA 1500 

ATTTCATACT AGTTTG7ACC GTATCTGTAG GCATCCTGTA AATAATTCCA AGGGGAAAAC 1560 

TAAACGAGGA CGTGGGTTGT ATCCTGCCAG GTTGAGTGGG GCTCACACGC TAGGGTGAGA 1620 

TGTCAGAAAG CGCTTGTATT TTAAACAACC AAAAAGAATT GTAAGGGTGG CTTGCTGCCA 1530 

GGCTTGCACT GCCGTTCCTG GGGGTGTGCA TCTTCGGGAA AGGTGGTGGC GGGGCGTCOk ' 1740 

CTAGGTTTCC TGTCCCCTGG TGGTCCTTCC GTAAGAAAAT GAAATATTCT ATGCCTAATA 1300 
CTCACA.CGCA ACATTTCTTG TACTTTGTAA GTGGTTTGCG AGAATGCAGA CCACCTCACT . 1360 - 

AAACTGTAAA CGGTAAAGAG ATTTTTACTT TTGGTCTCCG TGAGTGGCAT CTCTACTAAG. 1920 

GTTTACACAG GAATTCCACC TGAAGACTTG TGTTAAAGTT CTACAGGGCG CACTGTTAAC 1380 

TGAACGTCTT TTTCTTCAGC CTATACGCGG ATCCTTGTTT TGAGCTCTCA GAATCACTCA 204Q 

GACAACATTT TGTAACTGGT GCTGTTGCTT TCTACATACA CCTTATAAAG TGACATTTCA 2100 

AAAGAAATAA GGTGCCACAG TTTTAAACCA GAAGGTGGCA CTCTGTGGCT GCTTGTAGTA 2160 

TTATAGCTAT ACTGGGAAAG CATAGATACA GCAATAAAGT" ACAGTAATTT TACTTTTTTT 2220 

CTTGTGTTAC ATCTAAATTA CAACCCTTAA TTGCCACGTG TGGACTTACT ACTCTGCAGT 2230 

ATGTCTTATT ACTCTGCAGT ATGTCACGCA TCTTTAACTT TTCACGTCCT ATGTTTGCTT 2340 

TCTCCCATTT TTAAGAGATO GTAAGTTAAC TGGAATTGAT TTACTGAATG AAATTAAATG 2400 

CAGATATCCC TGTTTTTGAA ATAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 

AAAAAAAAAA AAAAAAAAAA AAA -ssnsr- - ■ - ■ 2433 



(2) INFORMATION FOR SEQ ID NO: .69: 

( i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid _ 
(CJ- STRANDEDNESS : double 
<D) TOPOLOGY: linear-' 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GAGAAATGGA GCTTTGTTAG ATAAAAATTT TTTCAACGCA AACAGTCATT TTCCAGTGAA 60 

AGGAGAGCGT ATCCGCCGTA GGATGGACTT AGATCGTGTA AAAGCTGAGG CCACCGAGGA 120 

TATAACCTCC GGGGTCCTTT GCCTCCTTTT CCTTAGACTC CCTCCAAACT CGTGTATCTT 130 

TCCTTCAGCA GTACTGGGCT CCAOGCGAAC CTAGTCCTTT GTCTTTACCC TATTACCTTT 240 

CATAACAT CC TAGTTGAAAA GTARTTATTC AACCGCGTTT GAAAATGAGA ACXGGTTCAC 300 

AGARGCTAGG TTACTTGCGA AGGTCGTTCA ATTAGTAACC AGTAACGCCA GGACTGCCAG 360 
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TTTCTTGCTT CCGAATTCTC ATGGTAGCTT TCACCARGCT CCCGGTOSftA TGCTAACGTC 
AACTACTGAA CTAJGATTAGC AAAAAGGTCT TTTAACAGAA TTCCTGGTTT TGAGAGAGAG 
TTTCTTTCAT GAAGCGCCCC ATTTCTACAG AGGAAAATAA ACTCCAAGCA GCCAGT 

(2) INFORMATION FOR SZQ ID NO: 70: 

(i) SZQUEHCZ CHARACTERISTICS: 

(A) LENGTH: 355 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CCACGCGTCC GGCCTTTCTT GGCCAGAGGC GCCGGTTGGA CTCACGGGCG GGGCATGATG 
GGTAACAGGA CCGGTGGGGT CCCCAGGAAG TCCTAGAGGG GGTCGGGGTT TGGGTGGACA 
AGCTTTCCTC GTCCTCTCCC GACAGAGCTG ACGTGTCCTG GGTTCCACCG GGAGCGGGCA 
TTTCCACCGG ACGGGAGGGT TCGGGGTGTC CGGGGCTGGG GAA.TACGTAG GGGTTGCCGC 
GCGGTGTGGG GAGTTGGGGC GTGTGGCTGC AGTCCCGGGA GTTCTTGGAG GGGGTCGGCC 
CACCGAGCTT CCGGACCGGC TGATCTGCCC GTAGCTTGCC GGAlMGGARGG CGGAGCTGAC 
TCTCCGTCCC TTCTCCCATC CCCTCCAGTG GTGGGTACGG GCACCTCGCT GGCGCTCTCC 
TCCCTCCTGT CCCTGCTGCT CTTTGCTGGG ATGCAGATGT ACAGCCGTCA GCTGGCCTCC 
ACCGAGTGGC TCACCATCCA GGGCGGCCTG CTTGGTTCGG GTCTCTTCGT GTTCTCGCTC 
ACTGCCTTCA. ATAATCTGGA GAA.TCTTGTC TTTGGCAAAG GATTCCAAGC AAAGATCTTC 
CCTGAGATTC TCCTGTGCCT CCTGTTGGCT CTCTTTGCAT CTGGCCTCAT CCACCGAGTC 
TGTGTCACCA CCTGCTTCAT CTTCTCCATG GTTGGTCTGT ACTACATCAA CAAGATCTCC 
TCCACCCTGT ACCAGGCAGC AGCTCCAGTC CTCACACCAG OCAAGGTCAC AGGCAAGAGC 
AAGAAGAGAA ACTGACCCTG AATGTTCAAT AAAGTTGATT CTTTGTAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAA 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 93 2 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE 'DESCRIPTION: SEQ ID NO: 71: 

TCATCATATA CAAAGTTTTT CGTCACACTG CAGGGTTGAA ACCAGAAGTT AGTTGCTTTG 60 

5 

A.GAACA.TAAG GTCTTGTGCA AGAGGAGCCC TC'GCTCTTCT GTTCCTTCTC GGCACCACCT 120 
GGATCTTTGG GGTTCTCCAT GTTGTGCACG CATCAGTGGT TACAGCTTAC CTCTTCACAG . 130 

10 TCAGCAATGC TTTCCAGGGG ATGTTCATTT TTTTATTCCT GTGTGTTTTA TCTAGAAAGA 240 

TTCAAGAAGA. ATATTACAGA TTGTTCAAAA ATGTCCCCTG TTGTTTTGGA TGTTTAAGGT 300 

j 

AAACATAGAG AATGGTGGAT AATTACAACT GCACAAAAAT AAAAATTCCA AGCTGTGGAT 360 

15 

GACCAATGTA TAAAAATGAC TCATCAAATT ATCCAATTAT TAACTACTAG ACAAAAAGTA 420 

TTTTAAATCA GTTTTTCTGT TTATGCTATA GGAACTGTAG ATAATAAGGT AAAATTATGT " 430 

20 ATCATA.TAGA TATACTATGT TTTTCTATGT GAAATAGTTC TGTCAAAAAT AGTATTGCAG 540 

ATATTTGGAA AGTAATTGGT TTCTCAGGAG TGATATCACT GCACCCAAGG AAAGATTTTC 600 

TTTCTAAC AC GAGAAGTATA TGAATGTCCT GAAGGAAACC ACTGGCTTGA TATTTCTGTG 650 

25 

ACTCGTGTTG CCTTTGAAAC TAGTCCCCTA CCACCTCGGT AATGAGCTCC ATTACAGAAA 72 0 

GTGGAACATA AGAGAATGAA GGGGCAGAAT ATCAAACAGT GAAAAGGGAA TGATAAGATG 790 

30 TATTTTGAAT GAACTGTTTT TTCTGTAGAC TAGCTGAGAA ATTGTTGACA TAAAATAAAG 840 

AATTGAAGAA ACACATTTTA CCATTTAAAA AAAAAAAXAA. ACTNGAGGGG " GGCCCGGTAC 90 Q 
C CAAATCGCC GCATAGTGAT CGTAAACAAT CT .932 

35 



(2) INFORMATION FOR SEQ ID NO : 72: 

40 

(i) 'SEQUENCE CHARACTERISTICS : 

{A} LENGTH r 996 base pairs 
(3) TYPE: nucleic acid 
(C) STFANDEDNESS : double 
45 • (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

• CGCCTGGCAC CA.TGAGGACG CCTGGGCCTC TGCCTGTGCT GCTGCTGCTC CTGGCGGGAG 60 

50 

• ' CCCCCGCCGC GCGGCCC&CT CCCCCGACCT GCTACTCCCG CATGCGGGCC CTGAGCCAGG 12 Q 

> 

AGATCACCCG CGACTTCAAC CTCCTGCAGG TCTCGGAGCC CTCGGAGCCA TGTGTGAGAT 130 

55 ACCTGCCCAG GCTGTACCTG- GACATACACA ATTACTGTGT GCTGGAOAG CTGCGGGACT 240 

TTGTGGCCTC GCCCCCGTGT TGGAAAGTGG CCCAGGTAGA TTCCTTGAAG GACAAAGCAC 300 

GGAAGCTGTA CACCATCATG AACTCGTTCT GCAGGAGAGA TTTGGTATTC CTGTTGGATG 360" 

60 . 
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ACTGCAATCC CTTGGAATAC CCAATCCCAG TGACTACGGT GCTGCCAGAT CGTCAGCGCT 420 

AAGGGAACTG AGACCAGAGA AAGAACCCAA GAGAACTAAA GTTATGTCAG CTACCCAGAC 430 

5 TTAATGGGCC AGAGC CATGA CCCTCACAGG TCTTGTGTTA GTTGTATCTG AAACTGTTAT 540 

GTATCTCTCT ACCTTCTGGA AAACAGGGCT GGTATTCCTA CCCMGGAACC TCCTTTGAGC 500 

, A^AGAGi^^G C^^CC^TGC 7, TCTCATTCCC TTGACTCATG TCTTGCCAGG ATGGTTAGAT 660 

10 * " . 

ACACAGCATG TTGATTTGGT CACCTAAAAA GAAGAAAAGG ACTAACAAGC TTCACTTTTA 720 

• TGAACAACTA TTTTGAGAAC ATGCACAATA GTATGTTTTT ATTACTGGTT TAATGGA.GTA 730 

15 ATGGTACTTT TATTCTTTCT TGATAGAAAG CTGCTTACAT TTAACCAAGC TTCTATTATG 34Q 

C C TTTTTCTA ACACAGACTT TCTTCACTGT CTTTCATTTA AAAAGAAATT AATGCTCTTA 90 0 

AGATATATAT TTTAYGTAGT GCTGACAGGA CCCACTCTTT CATTGAAAGG TGATGAAAAT 9b 0 

20 

CAAATAAAGA ATCTCTTCAC ATGARAAAAA AAAAAA 996 



25 

(2) INFORMATION FOR SEQ ID NO: 73: 

{ i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 735 base pairs 
30 (B) TYPE: nuclaic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

35 

GGCACGAGGG GCTTTGCGTA CACAATAGCT GCTAGGAGTA CCCAAAGCCT GARTACARCC 60 

TGCTGGTGTC ATGGCCACGT GTGAGCAGGC CAGCGTCAMA CGGCTCGCTG TGACCCGTCC 120 

40' CGRAGACTGA AATGGGCCTG GGTCTTCTCC TKGTCCTGTG ATWAAAGTCC TCTCTTGAAA 130 

GTGGAGAGCA AAGGCACACA GAGGTGCGCG CTCACAAGAA. TTCCTCCCGG TGACTGGGTA 240 

ATCAATGTTA CTGCTGTTTC CTTTGCAGGA AAGACCACAG CAAGATTCTT TCATTCGTCT 300 

45 

CCTCCTAGCC TGGGGGACCA GGCTCGAACT GACCCTGGAC ATCAAAGGAG GGATTATGTG 350 

GCTGCTAAAG CCATCGGCCC ACAGCCCTGT TCACRTCTTG GTGCTTCTCT TTCCCAGAGG 420 

50 CTGGTCCCAG CCAGGCACAC ACAAAAGGCA GATTCTCGTA AACSCAGCCT CCCTCCCTGG 430 

AGGCTGCCTC CTGCCCTGGA TCTGGAGTGG AGCTGCTCTG AGATTTTGAG TTCTTCTGCA 540 

GAGATGATTA AATATATCCA AGAGACATTG GAAAACCTGC TGAACATTTT ACATTGGTCT 500 

55 

GCTCAGCACA TGGCTGGATG CGGATATTTC TA.TAATTCCA GAAAGTCACA CAGCTCCTCT 660 

GTATGAGACC AGTGGGCGCC ATTTAAAAGA ACAGGATGAG AATCTAAGAT ATATTATTAA 720 

60 TAAATGTAAT GGATTTTTTT TTTGTAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAlAA 730 
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AAAAA 

(2)- INFORMATION FOR SEQ ID NO: 74: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1069 base pairs 
(3) TYPE: nucleic, acid 

(C) STRANDEDNES3 : double 

(D) TOPOLOGY : linear 

(;ci) SEQUENCE DESCRIPTION : SEQ ID NO: 74: 
TCCTCACCAT TCCCCTAGGN CAGGTCCCTG CAGGTCCCAC ACTTCTCCCA GGTCCCTAAA 
CTTGGGTCGG TCCTTTCCCT GGAGTAGCTG GNTCCTCCAG TCGAGGTCCC TGTTCAGTCG 
GTTCTTAGGC TCCTGCACAT GAAGGTGTGT GCCTGTGGTG TGTGGGCTGC TCTAGGAGCA 
GATACAGGCT GGTATAGAGG ATGCAGAAAG GTAGGGCAGT ATGTTTAAGT , CCAGACTTGG 
CACATGGCTA GGGATACTGC TCACTAGCTG TGGAGGTCCT CAGGAGTGGA GAGAATGAGT 
AGGAGGGCAG AAGCTTCCAT TTTTGTCCTT CCTAAGACCC TGTTATTTGT GTTATTTCCT 
GCCTTTCCGA GTCCTGCAGT GGGCTGCCCT GTACCCTGAA CCTCATGAGC CTCTAAGGGA 
AAGGAGGAAC AATTAGGACG TGGCAATGAG ACCTGGCAGG GCAGARTACA AGCCCAGCAC 
CAGTGTCCCA GCCTTACTGG GTCCTTACCC TGGGCCAAAC AGGGAGGGCT GATACCTCCT 
TGCTCTTCCT AGATGC C C AC CTCCTACAAT CTCAGCCCAC AAGTCCTCTC CACCCTAGGG 
GGCTTGCTGC ATGGCAATAA. CTCATAATCT GATTTGGAGG TTTGCCCTTT ACAGGGGCAG 
ATTTTCTGCT CAGTTCAACA ATGAAATGAA GAGGAACTCC CTCTTTCTAC AGCTCACTTC 
TATCAGAGGC CCAGGTGCCT CAGAGCCACA TTGAGTTGCT TTTTCTGGGA TGAGGAAGTA 
GGGTTAAACT CCCCAGTTTC CTGAGCGAGG CTCCTGACAG GTGCCCTTTG TCAGACCCTA 
CCLAOiGCCTG GATAGGCAGC CACATTGGTC CTCGCCCTTG CTCGGNACTC CGTGGTGGTC 
CTGCCCTTCT CCCTGCATGC CTGTGGGTCT GCTCTGGTGT GTGAAGGTCG GTGGGTTAAC 
TGTGTGCCTA CTGAACCTGG CAAATAAACA TCACCCTGCA AAGC'CAAAAA AAAAAAAAAA 
.AAAAAAAAAA AAAAAAAAAA. AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 

(2) INFORMATION "FOR SEQ ID NO : 75 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 
(3) TYPE- nucleic acid 
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(C) STEAMDEDNESS : double 

(D) TGPGLCGjf : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
GGAGATTAGA TCACTGTGGA CCTAAAJ^CAA ACAAACAACT ATAAGGAAAA TGGCATTAGA 
AATGGTCTGG - GGATCAGTTT ATCACTGCAG TTGTTACATC ACCCCATGGT CTAAAATACA 
GAGCTTTAGT CTGTCTCTGT TTGAGTTCAT TTTAC AGGAG GTGAACATGA CACTTCCAGA 
AAACTCTGTC TGGTATGAAA GGTATAAATT TGATATTCGT GTCTTTCACT TGAATGGCCA 
GTTTGTGATG ATGCATGC-AG TAAACACCTC AAAACTTGAA AAACAGCTCC TGAAACTTGA 
GCAGCAAAGT ACTGGARGCT GAG TG ATGCG CTCATGATTT TCCACCCTCT CTTCCCATAA 
AGCATCTTCC TAAGGAAATG AMCATGGCCT GATACTCATT TTGTGACTTG TACAGAGGCC 
TAAGGATGTT CTGAATTCAG TGGTGCCAAA TAAATGTTGA GATTCCCCTT TTGGTTGATG 
GAAGTATCAG TGTGGGAACT GTTTGCTTAA TGGCATTTTA TAAAATAAKA AKAXCATATT 
AGGAGGGAGG GAGATGATGG AGGGAGGGAG AAGTCGATTT GTCTTATTTA TCCTTTTTGT 
ATTAATAGAG AAGCACTTCA CAGTCACTGG CAATGCCATT TATAGGAAGA AGGTTCTGCA 
TTCGTGCTGG TCCCGGAGGG CTTAACTTTT TAATGAAAGA ATAAATGGTC TTCCACTCAG 
TAGATAAAGT GAAATGTGAA TTGTTAATAA CTGTGCAGGG TGAATAAAGG GATGTTTTAA 
.GGAATACAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACTCG A 

(2) INFORMATION FOR SEQ ID MO : 76: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 90 base pairs 
(3) TYPE: nucleic acid 
(C) STEANDEDNE5S : double 
■ <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

TATATATAGA CNGTTAATAG TCGTGANTGN- TGTGNACGAA CATTAACGGA AGTAGCATGT 

AGCCAGTCGA ATAACMTATA AGGACAAAGT GGAGTCCACG CGTGCGGCCG TCTAGACTAG 

TGGATCCCCC GGCTGCAGGA TTCGGCACGA GCTGGCAGGT GAGGAGCAGA GAGACTGTTC 

CCTTGGGTGG AGAGGTGTGG GCATGAGAGC CACCCATTGC CAAGCAGCAA. GAATGTTCGT 

GCTTTTTTCC CTTCCAAAAT , ATGCAGGGCT CAGGCTCCCA ATTCCGGGCC TGTCTGCTTT 

GCTTGTGTTT CTCCTGTCCC TGTTCTCCCG GAGGGCCCAG GTGGAACTCA CGACAGGGAG 

GGAGACGCTT CCCAAAAACC TGCAGGGCTA TTTCCCAGAA TTTGGTTTTC AAGTACAAAA. 
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CTTTTTGTCC TGTAAGATAT ATGCAGCCTC ACAGAAGCAG CCTCTGCCTG CACTTTACCA 480 
GCTACGTTTT TATGTTAAGC ACATGGGGCT CCCTTAGAAC TTACTCCAGT GATTTAAAAA . 540 
5 AAAAAAAAAA AAACTGGAGG GGGGGCCCGG TACCCATTCG CGGTAAAAGT 590 



10 (2) INFORMATION FOR "SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1274 base pairs 
(3) TYPE: nucleic acid 
15 (C) STRAMDEDNE33 : double' 

(D) TOPOLOGY :'' linear 

(XI ) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

20 GAGCCACCAC ACCTGGCCTG GAAGGAACCT CTTAAAATCA GTTTACGTCT , TGTATTTTGT' 60 

T.CTGTGATGG AGGACACTGG AGAGAGTTGC TATTCCAGTC AATCATGTCG AGTCACTGGA 120 

CTCTGAAAAT CCTATTGGTT CCTTTATTTT ATTTGAGTTT AGAGTTCCCT TCTGGGTTTG 130 

25 

TATTATGTCT GGCAAATGAC CTGGGTTATC ACTTTTCCTC CAGGGTTAGA TCATAGATCT 240 

TGGAAACTCC TTAGAGAGCA TTTTGCTCCT ACCAAGGATC AGATACTGGA GCCCCACATA 300 

30 ATAGATTTCA TTTCACTCTA GCCTACATAG AGCTTTCTGT TGCTGTCTCT TGCCATGCAC 36 Q 

TTGTGCGGTG ATTACACACT TGACAGTACC AGGAGACAAA TGACTTACAG ATCCCCCGAC 420 

ATGCCT CTTC CCCTTGGCAA GOTCAGTTGC CCTGATAGTA GCATGTTTCT GTTTCTGATG • 430 

35 

TACCTTTTTT CTCTTCTTCT TTGCATCAGC C^ATTCCCAG AATTTCCCCA GGCAATTTGT 540 

AGAGGACCTT TTTGGGGTCC TATATGAGCC ATGTCCTCAA AGCTTTTAAA CCTCCTTGCT 500 

40 CTCCTACAAT ATTCAGTACA TGACCACTGT CATCCTAGAA GGCTTCTGAA AAGAGGGGCA 660 

' AGAGCCACTC TGCGCCACAA AGGTTGGGGT CCATCTTCTC TCCGAGGTTG TGAAAGTTTT 720 

CAAATTGTAC TAATAGGSTG GGGCCCTGAC TTGGCTGTGG GCTTTGGGAG GGGTAAGCTG 730 

45 

CTTTCTAGAT CTCTCCCAGT GAGGCATGGA GGTGTTTCTG AATTTTGTCT ACCTCACAGG 840 

GATGTTGTGA GGCTTGAAAA GGTCAAAAAA TGATGGCCCC TTGAGCTCTT TGTAAGAAAG. 900 . 

50 GTAGATGAAA TATCGGATGT AATCTGAAAA AAAGATAAAA TGTGACTTCC CCTGCTCTGT 960 

GCAGCAGTCG GGCTGGATGC TCTGTGGCCT TTCTTGGGTC CTGATGCCAC CCCACAGCTC 1020 

CCAGGAACCT TGAAGCCAAT CTGGGGGACT TTCAGATGTT TGACAAAGAG GT AC CAGGCA 1030 

55 

AACTTCCTGC TACACATGCC CTGAATGAAT TGCTAAATTT CAAAGG A AAT ' GGACCCTGCT 1140 

TTTAAGGATG TACAAAAGTA TGTCTGCATC GATGTCTGTA CTGTAAATTT CTAATTTATC 12Q0 

60 ACTGTACAAA GAAAACCCCT TGCTATTTAA TTTTGTATTA AAGGAAAATA AAGTTTTGTT 1260 
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TGTTAAAAAA AAAA 1274 



{2) INFQFMAT ION FOR 5ZQ ID NO: 73: 

{ i) SEQUENCE C^JtACTERISTICS : 

. (A) LENGTH: 113 3 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES5 : double 

(D) TOPOLOGY: linear 

(:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



AGGATTTTTC CTTGTTCAAC CAAAATCTGA . GCATTCTTTC TATGTTGAAA ACACTGAAAA 60 
ACTA ATTTWA GTTAATGAAC TAGAAAGAAT ATTGATTTTW. AAGAAACAGA AAAATACTAC - 120 

TTATTTTCCT TCTCAAATAA OGTTTCTTTC AAAAACTTCT GGCTGAAGTA TAACATGCTG 130 

GTAGTTAACA TAAATCTTGT CTTTCTCTTG TTCTTTATCT TTCTTTGTTA TTTAGATGCT 240 

TGTATAAATG TCTTTTGTTT TTATTAAGTG CCTAATTGAC AGAGCTTAAT TTGAAGAAGT 300 

GCCCTAATTT ATTGACCACT TAAGAATTGC - CTTT ATTGGG GTATTTTATT TGTTCCTGCG 360 

TCTTTTTGAT GTTGTTCAGT CTACTCATCC CTGTGAGTAT GTGTGGGGGA CAGCTGATAG 420 

AAGGGAGGAG AGTGTGTCTA TGCTCAGGAT TGCCCTTTAG OCACTCAGCC AGAGATCCAC 430 

AGGGAGCAAC AAGGACAGTT TCACATGCTT AGACTTTCTT GGAAGAAACA GTGAGGAGGA 540 

GTAAGTCGTG AGTAGTGTCA AGCTGGATGT AGAATTGTCC TAAGGCAGTT GACCCCACCT 600 

TCCAACATGT TTTCACTTTA TTTGCCCCTC CCTACATTTG GGTTAGGTTC CATTTGGATT 660 

TGCAGCAATA ATGACTTTAT TTCTCTCTTG GTCAGGATTT GGCAGAXAAA ATCCTTTTAT 720 

TATAGAACTA GCTATTTTAG TTACATAGTA ATGTAACTAA TGGAGAGATT TATAGAGAAT 730 

TTTGKTTTTG CTGTCATATA TGTCCA.TTTT GGAGACAGAT ATGATAGAAC TAGAAATTAA 340 

GTTGCATTTC TGCAAGTGCC ATTTGAATGA ACTTCAAGTA TCTTCTTAAT TATTAAATTT 900 

TCTGA.TGAAG GCATTGTAAC AAATATATAG TATTA.TTAAA TCTAATTAAT ATTTGGAAAT 960 
ATTAATAAAT AGGTATTTTA TTTACTGTAA AAAGTCAAAC TTCATTATGT AGATAAATCT .• 1020 

TATTCTTTTC ATTCTTTCCC CTGTTTACAT CCTTTTTACA AAGCTTAGTC ACCAA.TTAAA 1080 

GCTTTCCTAT CAAAAAAAAA AAAAAAAAAA ACTOGAGACT AGTTCTCTCT CCT 1133 



(2) INFORMATION FOR SEQ ID MO : 79: 

(i) SEQCJEMCE CHAPACTEP.ISTICS : 
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(A) LENGTH: 661 case pairs 

(B) TYPE: nucleic acid 

(C) STEANE'EDMESS : double 

(D) TOPOLOGY: linear 

5 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
. GAATTCGGCA. CGAGGGGAAA AGGATGCTGA ACGAGAGCAG AAAGCCTCTT TCCTTTGCTT 60 
• 10 CACGCCTTTC 'CAGTCTTTAT TTTAAACTCG GGTTCOCTTT CTGTGGTCGC AGCAACCTTT 120 
ACTCCACCTG CACTGCTGCT CCTGGGGGCT CCCCAGGCCT CCCTCTGCCT TTCTACCCAG 130 
TGGCTGACGG GATGCCTGTC TTGCCTGGAC GCACCACTGC TCTCCTGTCC CTCACCTTGG 240 

15 

' CTTTTGCTGT GCCCTGCTCT GGGGTTGAAG CTGGCCCATG TCTCCCCCGG AGTCATGGCT 3 00 

GCTCCTCCTG GGAGGC CTCT GTGTGCGTCA CGTCTTCCAC ACCTGGGGGC AGCTGGCGAG . 360 
20 CCCGTGCTCT GTTCCCCTCG GCTGCTTGGC ACAGAGYTGC AGCCTGGGAY TCTCCGTGGA 420 
CCCAGACTGG GGATTTTGCC AGGGGGGCGA TGGGAGGAGC AGGTGCTTTG CCTGGCGGCT 430 
GTGTCTGCAT TTCTGGACGC CCCAGAGCAC AGAAGTTGCC GGCACTTTGA GGTCTTCCTC 5 40 

25 

GGCATGTGCC AGATTACATG AGTGACGGCT GGGAATATGT TTTCTTTTTT GTAATGGAGG 600 
CGTGTTTCAG ATATAGTAAA GCTCACCAAA AAGTAAAAAA AAAAAAAAAA AAAAAACTCG 650 
30 A Sol ' ' 

35 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1373 bass pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

<Q) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3Q: 

45 ATTGGGT AC C GGGCCCCCCC TCGAAGTTTT TTTTTTTTTT TTTTAATGAA AGCTCTCAAA 60 ' 

TAAGCGATTT TATTCCTATC CATGATTGCA GACATTTACA AAACCA'TAAC ATCTGAGTTC 120 
ACCTTAAAAA ATAACTTATA TAAAGCAGTG ATATACACAG C\CAAAATAG TTCAGGGAGG '"' ' 130 

50 

GGGCAGGAGC AACTTGTAAT AATTAAAATG TAAACGTG^A AAAAAGGATG GAATAAAAGT ' 240 

CCCTACTTAT TTCTACTTAA GATGTCATGT GATAATATTT TACAATGTCC TGTGGGTCAA 300 

55 TGTATGTATG TGTATATGTC TGTATAACAT ACACATATAC AGTACATTCT CTTTCCCACA 2 60 

CATATACATA CACACATAAT TATTTGCAGT' TCAGTTTAGG GCAATTCTAA TATGCCACTC 420 

CGTACAGTTG TTTGAATCAC ATTTGGACCC GCTTTCTTCA CAAAAGAGGG GAGAGAGCAG 430' 

60 
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GAJkATAJ^-AA GGTTGGTTTG GTGTGACTGA GATTCCTTTG TTTAACTGTA G\CTGTGATG 54 Q 

AATAATTTTC TTCCGTAGTA GTTCTGTGAA GGGCTGACTC ACTGTGGTTT TCATGAGGAG " 600 

ACTTGGTAAT GGATCACACG CTCATTCTCA TGCTAGGGGA GTAACTCTCA CTCTGAAAAG 660 
GATTTAAGAA ATTTCCCCCC ATTTCGCCAT CATCCCTTGG AGTGCCCGGT TGATTACTCA ' 720 

GGCTCATATT ATTGGGAGAA. TTCTTGGAAA TACTGTCCAT ATCTCCTGAG CGTAAAGAGC 780 

CATTCATGTC- ATGTGACTCC ATTCCTCCTA ATCCACCCAT GGGACCATCT GACCCAGGRC 340 

CCATTGGAAA ATTAGGTCTG TTAGGTCCAG GAGC-TACTGC ATTCATTAAA GTATACATGT 900 

TATCACCAGA GTTGGTTGAA TCTGCTGGAC TAGGCATGAT GGGTGTTCCT GGTGGCCCTC 960 

CACCTCCTGG AGCAXZCTACA TAATTCCCAG GAGATGCTGA GGAGTATGGT ATTGAATTGG LQ20 

CATTTGTTGG GTTTGGCCAA GGTCTACCA.C CACCTGGACC CATGTTCATT CCAGGCATTC 1080 

CAGGGCCAGG TAAAGCATTC AGTGGGGGTC TCATTGCACC TCCATAGTTC TGTGGTCCTA 1140 

AGGGCACCAT TCCTCTTGGA GGAGTCA.TTC TCTGCATTGG CGCACGCATA TTTGGATGTC 1200 

CTTGTTGTCG AGTTGGATCC ATTCCACTGG GGAGTAATGG CTGACTTCCT GGGACACCTC 1250 

CAAGTGCCTG ATTAGGTATC CTCAATGGGG ; .GCCTTGGACC TCCAGGGTAC CGAGGTGACA 1320 

TAAAAGGGTA ATCATGGAAG GCTTTTGCTT CACTTGAGTG TTCACATGTT TCACGTCT 1373 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1440 base pairs 
(3) TYPE: nucleic acid 
(C'J STSANDEDNE5S : double 
(D) TOPOLOGY: linear 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 

ACTTTGTCCA AATGTGTCTG TCACATGTAG TCAGCTGMAG NAATTTAAAA TGAATTGCCA 50 

AGTGAAGAGT CTGTGGATTA ATTGGCCGTT AATTAACAGG CTTTATCAAT GTGTCCTCAA 120 

GGGAGAGGCC CAACCCIAAT TAAGGAGCTA AACTTCCTGA GTGAGGGGCT GTGAGGATGG 130 

AGGTGGAGGA GGCATCTGGG GCGGGTGGTG GCCGGGCCK CAGATGGCGC CTCCCTGGCT 240 

GAGCTGCCCG CACCGCCAGT TCCCTCATTT CCACTCAGGA AGGCAGAGAA GGCAGAGTGA 3G0 

TCTCCTCAAG GAAGAGCTTC CCCAGCCTTC GGGAGCAGCT GGCAjGGGCGT CCGGGAATAA 360 

- GCCCTACACG CCGCCGCCTG CCTCCAACTC ACTAACCCTG CGC'CTCTTGT CTTTCAGATT 420 

CAACGCGTTC AACAGAAGCC ATCCCCAGCC CAGCTTAAAT TATAAAGATA GACAATAACT ' 430 

CTGTTCCAAT CTGCGTGGTG CTTCTTTAGT AAATACTGTA CAGATTTTAC CATGGAGAAC 540 
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TTTTTTTTTA GTTTTT AC CT TTTCTTAATT ACCCTTATTC CGAATGGACG AACACTTTCT 500 

£CCACTGCTG ACCATTGTAA AATAC CGTGT AT ATAAATC C CATTGAAATA ATGCCGTGGA 560 

5 

ATAGAACATC TCAAATGCTG CTTAATTACA GACTCAGGTC GATTACTTGT ATTTCATGTA 720 

ATGTTCCTCC AAGTTAGACA TCTGGTGCAA ' GACCAACCGG GAGACCATGG AATTGTCAAA 730 

10 ACT ACAA ACT GACAGTGTGT ATATTTAATT TAAAGACTTA TTTAAAAACT CACAAGCTCT 340 

CACCTAGACT TTGGAGAGCA. GTCTGTTTTC TGTAATGTCT GATACTAGAA ACTAATTTGC 900 

TT-TT^TAG' 7, T GTATTCAAG ATTTGAAGAT GTATTTTATA GACAAGTTCT GTTTTTGAAC 960 

15 . ' . 

TTTGTGGAAC TGTTCCAATC AATCAATTTC CCAGTTATGA TGAGTATTTA CATTATGAAT 1020 

GTATAACCCA GACATGATTT GTAAAGCCGA CAGTATGTTT CTATTACACA AGACTTTTTG 1030 

20 ATACAGCGTC TCTTGTCTTC ACTGATACTG GAGTCTCCGT TCTCTGCNNG GTCCCTTCGA 1140 

GTTTCTAGTT ACAGACACAA TCATACTGTG ATTTTATTTT TAATATGGAT ATGCTATCAA 1200 

ACTGTG A T AC ACTTATAATT CACTGGTCCT GCATCAGGAG ATGGAGTGGG GAAAACTGTA 1250 

25 

TTT AATAC AG TTTGTATCTG AATAATCTGT ATGGTTTATA CAGTTTGTGT TGTTCAGAGA 1320 

TGTTT AAAGT TTGATCTTTG TTTTTCTAAA GATTAAAAAA GCACTTGCCC C*CTGTAAAT 13 80 

30 ATACAGCATG TAAAATTTCT RTAGTATATA AATGGCAGCA AATGACAAAA AAAAAAAAAN 144Q 



35 (2) INFORMATION- FOR SEQ ID. NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEMGTH: 13S1 base pairs 

(B) TYPE: nucleic acid 
40 '(C) STRANDEDNESS : double 

(D) TOPOLOGY: .linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 82: 

45 CCCGGGCTGC AGGAATTCGX YACGAGGCCA GCAGTTGCTC CCAGTT CAGG AGGTGCTCCT 50 

GTACCCTGGC CAOGCCCAA TCCTGCCACT GCTGACATCT GGGGAGACTT TACCAAATCT 120 

ACAGGATCAA CTTCCAGCCA GACCCAGCCA GGCACAGGCT GGGTCCAGTT CTGACCTGAG 130 

50 

CACGGTTTTT CCTCATGTGA CTTCTGGGAA GGCGCTCCCT CATCTGGGCC AAAGGAAGGA 240 

GGACGAAGCC CTCCTCAC-CT GGCCTGTGTT. TGGGGCATGA ATCTCTCCTC TCCTCCTTGT 300 

55 'CTGGCTCTGT TGACAAACCG GGCATGTTTG GCAGTAAATT GGCAC CGTGT CACACTGTTT 360 

CCTGGGATTC AAGTATGCAA CCAGAACACA GGAGAAGAAA AGCTCCAGGA TCCCTGTCCC 420 

CATC^GTC CTTGATGTGA GAG AO ACT CT GAGACTTCTT CCATCGCAAT GAC CTGTATT ■ 480 

60 
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a^CAOAGC CCCCCAAGCA AAAGAAGAGG TTGAGTTTOC TGCCAGGATT CAGATCAGCC 540 

CTTCCCAGGG TCTGCAGGTG TCACATGATC ACAGTTCAGC GGGAGGCTTT CCGTACCCAC 600 

ACTGGCTGTA GCACTTCAGT CCATCTGCCC TCCAGAGGAG GGTTTCTTCC TGATTTTTAG 660 

CAGGTTTAGA GGCTGCAGCT TGAGCTACAA TCAGGAGGGA A^TTGGAAGG ATTAGCAGCT 720 
TTTAAAAATG TTTAAATATT TTGCTTTGCT AATGTGCTGA TCCGCACTAA CTCATCTTTG • 730 

CAAAAGGAAC TGCTCCCTCG GCGTGCCCCA GCTGGGGCCT CTGAAGGGAT TCCTCACTGT 340 

' GGGGAGCTGC CCTGAGCTTC AGGCAGCAGT GTTCATCTCT GGGCAGTTGT GTGGTTTCCA 900 

TGTATTCTAG GCCAGGTAGG CAACACAGAG CCAAGGCGGG TGCTGGAAGC CAGACGGAAC 960 

AGTGTTGGGG CAGGAAGGTG GATGCTGTTG TCATGGAGCT GTGGGAGTTG GCACTCTGTC - 1Q20 

TGCTGGTGGC CCTCTCGGCT CACATGTTCA CAGTGCAGCT CCTGGCAGAC TTGGGTTTTC 1030 

TCTTTGGTGG TTTCTAAAGT . GCGTTATCTG OAACAAGTT CTTTTCTCCT TCAGGAACTG 1140 ' 

TGAATGGGTA GAAGAAGGAG CTCAGTAAAC TAGAAGTCCA GGGTTGCTTG GTTTACTGGT 1200 

TTATAAGAAA. TCTGAAAGGA CCTCTGACAT TCCTTTTATT AACTCACCTC' TCAGTTGAAA 1260 

GATTTCTTCT TTGAAAGGTC AAGAGGGTG.-^ ACTGAAAAAA GTGTTGGCCT TTTTGCGGGA 1320 

" CCAGATTTTT AAGATAAAAT AAATATTTTT ACTTCTGTCA AAAAAAAAAA AAAAAAATNT 1330 

G 1331 



(2) INFORMATION FOR SEQ ID MO: S3: 

(1) SEQUENCE CKAFACTERISTICS : 

(A) LENGTH: 1706 base pairs 
(3) TYPE: nucleic acid 

(C) STRANBEDNESS : double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: S3: 



ACTGCACCAC TGCCCAGGTC TCCCGGCTGG ATGAAGACGT GGTCCATGAG GAAGCTGGCT 60 

AGCTCAGACT GGAGAGTAGC TTCAGGAAAA AAGACAAGTG GCCTAAGGAA ATCACGGCCC 120 

CCAACTATCA TCTGAGGGCT AAAGATGAGA AGTAGATCAC TTAATAAGAC AAAAGCCTGT 130 

AGGGGGAAAA GAAAGGATGT TTAAAAGGAC AGAATGTTTC CC AA3GTAGA AATGACACTG 240 

TCAATTTCTC CTTGGAATGG GGGCAGGGAT ACTCGCCTTG TTGCTCCCAC TTGAGT CAGT 300 

ACTCACCTGC TCCTGGATCT CAGTATCCAC ATCTGAGAGG CAACTCTGGC AGAGTTCAOi 360 

GAAGGCCACC ATTCTGTCCC TC-AACTCGA CAGCTGCTTC TGTGGGCACA GTGGCTTGAA 420 

GGGGAAGAAT GAAGACACAG ACTCCTCTGT TCOCATTATC CCATCTAAGA CCCACACTCA 4B0 
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CCTGGGGAAG CATCTGATTT. AGAAATGTGG GTTAGTGTCC AGAGAATGGA AAAATAGACA ' 540 

AGAGTCAAGG CTGGGAGGAT AACCTGTAAC AACAAAGGGT TTGAAAAATG AGGTTTGGGT 60 Q 

TAGGAGAGGG AGAGACAGAT AGCCAGAAAC ACACCAGTGA AGAGGAGAGA AAATGAGTAA 660 
AGGGAGAGGT AATTCCTTTT. CCAGTGGAAA ATGAGTGATA TTCTGGACAT TCTTCA.GAGG . 720 

CATCTACACG AAGTAGAAAT GTCaCCGCTC CCTAATtTAC TCTACGTCTT GTAGAATCCC 73 Q 

TCAATATTAT CCTTGGCTTC CAGGAAATCC AAGAAGACCC TGGAAGTAGA GTCCACCTTC 340 

TAAGAGAGGA ATGTAAGAGG TGACCCCCAC CCACCTGATC TTCCTCGCTT TGTCCACTCC 900 

ACGCACTGAG ACTTGACACA CCTAGTGGCC ACCTAGAACG TAGGTCCTTA AAATYTAGCC 960 

CCCCAGCCCC CAACCCATCT CTACGCTGTC CACTCACCTG GTGAGGAACY TYTCCTGTGT 102 0 

CCACAGCYTT CTGCAGGAGT TGGCAACATG GCTCATAGA3 CTCCCAGCGA GTCAGGTCAT 1080 

GAGTGCTTTG GGGGAGAAAG GGGAATGTTA TACTGGAAAA GAACAGAGGG AACCAACTCC 1140 

ACAGACACGA GTAAAAACGG GATGGGGAAG AGGAGGAAAG CGACTCACTT GTAGAAGGCA 1200 

GAGAGGCGTT TCAGAGTGGC TGCCAGATTA t TATACCTCAT CCTCATCTAG GAAGGACGAC 1250 

TGAGAAGGAA" AGAAGATCCA CAATAGCATT TCCCCCAGAAJT^CATCAGTC CACATCCCCC 1320 

GTCTTGCAGC CCCTCCCACC CTTGTTTGGG GTGTCCCATT GTCCAGCCCC AGCTCCTACC 1380 

TGTAACAGCT CTTCAAGCTC CTGGTGGAAP. CGGTCAGTCA GCAAATCTAC TAGCTGGCTG 1440 

CGGGCAAAGT CCGGGGGGGT GAAGAAAGTG AATTCGGGAT TACAGAGCAG GTAAGAGCAT 15QQ 

GCGC CCCAGC CTCAAGCACC GCTGGCTCTG CATGCTTCAC CACCACCTCC TGGAGTTGGT 1560 

GCAGGAACAG CTCCAGGTGC TGAGAAGAAA AGGCAGAAGA TGGTGTGCTG TGGGGATGGG 1620 

AGGAGGACAG TCTTGTGGGG GGAAGTGGAA CGGGGTTAAA ■ AGGATT AAAG ■ TTCAAGGATA 1680 

AGATGCCTAA RAAAAAAAAA AAAAAA 1706 

(2) INFORMATION FOR SEQ ID NO : 34: ■ 

(i) .SEQUENCE CHARACTERISTICS : 

(A) .- LENGTH: 573 base pairs 

(B) TYPE.: nucleic acid ' 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 84: 

GAATTCGGGA CGAGCTTGGT AGC CTTAGAA CTGCATGAGG TGCTTTACCA CTGGGAAACA 60 

CGAGCACAGC CTAGCTTGAT TTTGTATGTG GTATCAGATC TAAGGTGGAT GGAATTCAGG 1-20 
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ACTTCCTGT C TACTCTTTGA TTTTGTTTTA TTTTTAGAAA TGTTTTATTT TGTTTTATTC 130 

ATTTATTCA.T CTTCAGAGAC ATGGTCTGGC TCTGTTGCCC AGGATGGAGT GCATGGTGTG 240 

A.TCATAGGCC ACTGCAGTGT TGAGCTCCCG GGCTCAGGCG ATCCTCCTGC CTCAGCTYCC 3Q0 

TTAGTAGCTG GGACTATAGG CACATGCCCT ACCATGGCTG GCTTTGTCTA. CTTTTTGAAT 360 
GATGTCYCAA ACTAGAAGGT CTATTAATTT AAAAAATTAA GGATAGCATG CCATAATTAA • 420 

AAATAATAAC AGTGGGAAAA GGCACCTTCC AATGATTCAG ACATCAACTT GTGATTTAAA 430 

AAAACGAAAA ATAAATAA.TA GGAAAAAAAG GGGAAAAAGT TAAATAAAAA TAAAATTAAA S40 

AJLAAAAAAAA AAAAACTCGA GGGGGGCCCG GTA 573 



(2) INFORMAT ION FOR SEQ ID MO: 35 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 634 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: S5 : 



CTCTTTGGCT GTGTCTACCT CCTTCATCTG CTGCGCCGAC ATAAGCACCG CCCTGCCCCT 60 

AGGCTCCAGC CGTCCCGCAC CAGCCCCCAG GCACCGAGAG CACGAGCATG GGCA.CCAAGC 120 

CAGGCCTCCC AGGCTGCTCT YCACGTCCCT TA.TGCCACTA TCAACACCAG CTGCYGCCCA 130 

GCTACTTTGG ACACAGCTCA CCCCCATGGG GGGCCGTCCT GGTGGGCGTC ACTCCCCACC - 240 

CACGCTGCAC ACCGGCCCCA GGGCCCTGCC GCCTGGGCCT CCACACCCA.T CCCTGCACGT 300 

GGCAGCTTTG TCTCTGTTGA GAA.TGGACTC TACGCTCAGG CAGGGGAGAR GCCTCCTCAC 3 60 

ACTGGTCCCG GCCTCACTCT TTTCCCTGAC CCTCGGGGGC. CCAGGGCCAT GGAAGGACCC 420 

TTAGGAGTTC GATGAGAGAG ACCATGAGGC CACTGGGCTT TCCCCCTCCC AGGCCTCCTG 430 

GGTGTCATCC CCTTACTTTA ATTCTTGGGC CTCCAATAAG TGTCCCATAG GTGTCTGGCC . . 540 

AGGCCCACCT GCTGCGGATG TGGTCTGTGT GCGTGTGTGG GCACAGGTGT GAGTGTGTGA 600 

GTGACAGTTA CCCCATTTCA GTCATTTCCT GCTGCAACTA AGTCAGCAAC ACAGTTTCTC \ 650 

TGAAAAAAAA AAAAAAAAAA AAAC 634 



(2) INFORMATION FOR 3EQ ID NO: 3b: 

(i) SEQUENCE CHARACTERISTICS : 

(A.) LENGTH: 'i036 base pairs 
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(3) TYPE: nucleic acid 

(C) STRANDEDNES3 : double" ■ 

(D) TOPOLOGY': linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TGGAGGCAGA TGCACAGGAG AAAGGTTCCC GTCCGCACCC TCTCAGACCT GAGGCTGAGC 60 

TTGCAGTGAG GGCTTCTCCT CGGCCCCTCG CCCGCCCCCA- GAGCTGCCAT CCCTGCTGTT 120 

ACAAGCCAGA GGAGCCCGGA TGTGAGGCCC CAGATCACCT CCAGGGACTT GGGGTTCCCA 130 

TCTGAAATCC TTTATTTTTG TACCATGGGG TGGGCCCCGG GCTGAGAAGG AAGAAGCACC 240 

CTCTCCCCGG GCTCGTCTGT CTGCACCCGT GGGGCTGTGA CTTACTCCTG CCTCGAGGGG . 300 

CGGGGGGGGG CCCCCTGGGA CCTCTTAAGG CCCAAGGTGG GCCCCKGAC CTYTGGGCAG 360 

AGTGGAYTGC. TCATGGCAGA TGTGTGGCAA TGTCTGGCTG WSTCTTTCCG GCAMCTGCGT 420 

YCCCTYTCCC GGGOTCCCCT GCTGCATGGT GGATGTGCTC CTTCCTGGCC CGGTCACATT 43 Q 

GCCTCCTTGA GCCTTAGTCC AGGGGGTCAC TYCTCCCACC CCACCTACCT CACAGGGTTG 540 
TTGTGAGGGT GCACAGAGGA GCAAAGTCCC TGAAGGCCCT CAGGCAGTAT ATAGGGGCCG . 600 

CCCACCTTCA GCTGCCCTGG GATGGGAAGd'' ACCCAGCCCG ACC CCTGGGC ATAACACTGT 660 

GTTTGCAAAT GGAGATTCAG GTATTGGGGA TGCAGGTTGT GGGGAGCTGG CCTGGCAGAG ■ 720 

TAGGGGTAGT TGGCTTGGCC TTGTCTTTGG TGATCCCACC CCCAGCCATT TGCATTGCTG 730 

GCCCAGCGCC TGGGGTGGGG GGCGGGGAGA GGCAGCAGAA GGGGCTGGGC AGGGGCGGTG 340 

GAGGACTCAG GAACTGCCCG GGGAGAGTGG GTATGGGGGC TGAGCCAGGG GCCCTCCTGT 900 
GTTTGACTTC CCGGGATGGG TCCTTGCTTC TCAGCTGTGT CCGACCCCAC CATGT AAT AA. . 960 

AACCCAAAGG A^CAGCAAAA AAAAA&AAAA "AAAAAPAAAA AAAAAAAAAA AAAAAAAAAM 1020 

CCCNGGGGGG GMCCCG 103 6 



{ 2 ) INFORMATION FOR SEQ ID NO : 37 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 903 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDMES3 : double 

(D) .TOPOLOGY: linear 

C:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
TTAAACAAAT GGAATCATGC AATATGTGAC CTTTTGCGTC TGGCTTATTT TATTTAGCAT 60 

AATGTTTTTG _ AGGTTCATCC AAGCTGTAGC ATGTATCAGC ACCTCATTTC TTTTTCTGGC 120 

i 

TGAATATTAT TCCATTATAT GGATTTACCA C^ATTCATTT ACCTATTCAT CTTTTGTTTC 130 
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TGCTGTCTGG CTATTGTGAA TAATGCTTCG ATAAACATTC ATATAGAAGT TTCTATGTGG 240 

CTTTATGTTT TCATTTCTCT TGGCTATCTA CATGGGAGTA GAATTCTAGG TC ATAATATA 300 

ATTTTATGTT TAACTTCTCA AAGAATTGCC AAAAGGTTTT TCATAGTGGC TGCATCATTT 360 

ACATTCCCAC CGGCAATGTA CA&GGATTTC TATTTTTCCA TATCCTTGCA CTTACCAACA 420 

CTTCTTTTTK GTWATWATTT TGTTTTTTCA TTATTGCCAC CCTAGTGGAT GTGAAATGGC 430 

ATCTTATTGT TTTGATTTGC ATTTCTCTAA TGACAAATGA TATCATACTT TTTTTATGTG 540 

CTTACGGATC AAAGGTATTT CCTTGGAGAA ATGTCCCTTC AAGTCCTTTG CCATTTCAAA 600 ' 

ATTTGGTTAT TTGTCTTTTA TTATTCAGTT TTAAGAAATT CTGGCCA3GC GCAGTGGCTC 660 

ACCTGTAATC OTAGCACTTT GGGAGGCCAA GGCGGGCAGA TCACTTGAGK TCAGGACTTC 720 
GAGACCAGCC TGGCCAACAT GGTGAAACCC CATCTTACTA AAAATACAAA AATTAGCTGG . 730 

GCGTGGTGGC AGGTGCATGT AATCMTATCT ACTCAGGAGG CTGAGGCAGG AGAATCGCTT ■ 340 

GAACCCAGGA GGCGGAGGCT GCAGTGAGCC AAGATCACGC CATTGCACTG TAGCCTGGGT 900 

GACACAGA 903 



(2) INFQRMAT ION FOR SEQ ID MO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 655 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNE3S : double 

(D) TOPOLOGY linear 

(>:i) SEQUENCE DESCRIPTION: SEQ ID MO : 33: 



TGCACTGGTT CCTTCTCCCC AGCAAATACT GCCTTCTTGT TTTTCTCTGA TGTGGCAGGT 60 

GACTACAAAA TCCGCCTTGG TATTCTTCAA ATGCATATAT ATTCCTTTCT TGTGAGCTCC 120 

CTCTCTTCCT AGATTAGAAA ACTGCCTCAT TTTCTGCTCA CTGGATGTGC AGTCC'CAGCT 130 

TGTCTTCCTC TCCTCCCCCC CTGTTGCAGG TGTTCTTTTT TTTTTTCTTC TCTCCCCACT - 240 

GGGCAGCAAA AGTTGTTCCA CAGTGGAAAW TTAGGCATCC TCAAGTTTCY TCCCAGCTTC V 300 

TGCTGTGTTT TCTTAGAGTA AATTGCCAAT TTCTGTTTTT ACAGGAAATC CTTTTTTAAA 360 

AATGGAATCA GTGTGGTCCC CATCTACTCT GCAAAAATTG CATTTTTCTC TATTTTCAAA 420 

TGAGATTTGT TCAftGTTTCA AAACCACGTG AAATAATAAA TGTATAGTAG TTTTCTTTTC 430 

CTTGGGCATT GCTWGATATG TGAAATGGGT TTATGAAAAA TAATAAAATC ATAACGCTAT 540 

TTGTTTGACT TTCAATTTCA TGGGAATTTT TCTCAGCTAA ACTCTAAATO GTGATTAFGC 600 
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AAAAAAAAAA AAAAAAAACT GHAGGGGGGC CCGGTACCAA TTCGCCCTAT AATQA 

(2) INFORMATION FOP. SEQ ID NO : 39: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1102 base pairs 
•(B) TYPE: nucleic acid 

(C) STRANDEDNE3S:' double 

(D) TOPOLOGY: -linear 

(xi) SEQUENCE DESCRIPTION: SZQ ID MO: 89: 
TTTTTTTTTT ACCATTTAAA ATAAAATGAA AGTGACCTTC TGTTTATAAA AATCTTTGTC 
TGCATCTCTG CTTATTTCCT TAGAAGAGAT TCCAAGAAGC GGTGAGTGAT TTCACGGCAG 
CAGAGGGTTG GGACATATTA CGGGCGCGGA TCCCTCTTGG AGTGAGATGA CTCTCCGGAG 
AGATTTAGTC GTCACCCTCG CGTGTGAGGC TGCGTCACAC CCCAGGGATG TGTCTATCAA 
GATGGAAGAT CTTTTACACG CTCTTGATTT TGTTTG5CTY TTTTTCTATT ACTAGTGAGA 
AXGAAACTTT TTA.TATGATT ATTATCCATC ATAATCCAAC ACAAA.TTACT GCTTCATGTT 
CTTTTACTTT CCTGTGAAGG TTTTAGTGCC TTTTAAAAAT TGCTATATAT TAAGCTTGTT 
AATACTTC C A TGCTGTATTT GTGG3CATCA RTTTCCCCGG GNACAGGCNT C-GACATTTTG 

CCTTCACACG CTGGGTGGTT TTTCATTTTC AOTTCTATTT CTCGTTCTTC TATCGTTTTA 

c 

TGTTCAGACG GGTTTCTCCG TGTAGAAAGC AGTTTATGAA GATTTACTTT CGACAGTCTT 
CTCTCTACTT TCTACAGTGA ATTCTCTGAT GTGTCTGGGA GTTTGGGGGT GTGGGTAAGA 
RTCCTCCTCT CACCCTATTC TCTATTACGA TCCACAGCCT CATGCTTTAT GARATTGGTG 
GCCGGGAKCG GGGGAGATTT' GCGGATCCCC CAAGCCAGAC TTTATCCCCC TATCCCTGCC 
TCTGGATCCC ACGTACAGGC CTGGGAACTC CCTGTGGGTA GGGGCCAATG GTCTCGCACT 
CTCACCTGTA CCCCAGGGCT GGCACAGGAT GGTCAAGGAG AGAGGCTGCC CAAGCGCATC 
CfTCTGGTGT CCCCCTGACA CGCCTCCAAA GTGAGCAGGT AGGTTTCAAC AGCCCCACGT 
TGCAGGTGGG AGATGAAGCT CAGGGTGGAG ACCAGTATCT CACAGTTCTC TTTGCATGGC 
CGGGTACTTG TTAGTCAACT GATGAAGTGA AAATTCTAGC CCCAGAGGCA GGAGAATCCG 
"GAACAAAATT AAACCAGCCA go 

(2} INFORMATION FOR SEQ ID NO: 90; 

{ i ) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 1333 base pairs 
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(3) .TYPE : nucleic acid 

(C) STRAMDEDMES3 : double 

(D) TOPOLCGY: linear 

(:dj SEQUENCE DESCRIPTION : SEQ ID MO: 90: 

GGCACGAGCC GfrJCACGGGCA GCGCCCCATA GCGCCAGGGA CCCCCTGGCA * GCGGGAGCCG 60 

CGGGTCGAGG TTATGGATCC AGCGGGCGGC CCCCGC-GGCG TGCTCCCGCG GCCCTGCCGG 120 

TGMCTGGTGC TGCTGAACCC GCGCGGCGGC AAGGGCAAGG CCTTGCAGCT CTTCCGGAGT 130 

CACGTGCAGC CCCTTTTGGC TGAGGCTGAA AT CTCCTTC A CGCTGATGCT CACTGAGCGG 240 

CGGAACCACG CGCGGGARCT GGTGCGGTCG GAGGAGCTGG GCCGCTGGRA CGCTCTGGTG 300 

GTCATGTYTG GAGACGGGCT GATGCACGAG GTGGTGAACG GGCTTCATGG AGCGGCCTGA 360 

'CTGGGAGACC GCCATCCAGA AGCCCCTGTG TAGCCTCCCA GCAGGCTCTG' GCAACGC3CT . 420 

GGCAGCTTCC TTRAACCATT ATGCTGGCTA TRAGCAGGTC ACCAATGAAG ACCTCCTGAC 430- 

CAACTGCACG CTATTGCTGT GCCGCCGGCT GCTGTCACCC ATGAACCTGC TGTCTCTGCA 540 

CACGGCTTCG GGGCTGCGCC TCTTCTCTGT GCTCAGCCTG GCCTGGGGCT TCATTGCTGA 600 

TGTGGACCTA. GAGAGTGAGA AGTATCGGCG TCTGGGGGAG ATGCGCTTC-. CTCTGGGCAC 65Q 

CTTCCTGCGT CTGGCAGCCC TGCGCACCTA CCGCGGCCGA CTGGCCTACC TCCCTGTAGG 720 

AAGAGTGGGT TCCAAGACAC CTGCCTCCCC CGTTGTGGTC CAGCAGGGCC GGGTAGATGC 730 

ACACCTTGTG CCACTGGAGG AGCCAGTGCC CTCTCACTGG ACAGTGGTGC CCGACGAGGA 340 

CTTTGTGCTA GTCCTGGCAC TGCTGCACTC GCACCTGGGC AGTGAGATGT TTGGTGCACC . 900 

CATGGGCCGG TGTGCAGGTG GCGTCATGCA. TCTGTTCTAC GTGCGGGCGG GAGTGTCTCG 960 

TGCCATGCTG CTGCGCCTCT TCCTGGCCAT GGAGAAGGGC AGGCATATGG AGTATGAATG 1020 

CCCCTACTTG GTATATGTGC CCGTGGTCGC' CTTCCGCTTG GAGCCCAAGG ATGGGAAAGG 1030 

TGTGTTTGCA GTGGATGGGG AATTGATGGT TAGCGAGGCC GTGCAGGGCC AGGTGCACCC 1140 

AAACTACTTC TGGATGGTGA GCGGTTGCGT GGAGCCCCCG CCCAGCTGGA AGCCCCAGCA - 1200 

GATGCCACCG CCAGAAGAGC CCTTATGACG CCTGGGCCGC GGTGTGCCTT AGTGTCTACT 1260 

TGCAGGACCC TTG CTCCTTC CCTAGGGCTG CAGGGCCTGT CCACAGCTCC TGTGGGGGTG .• 1320 

GAGGAGACTC CTCTGGAGAA GGGTGAGAAG GTGGAGGCTA TGCTTTGGGG GGACAGGCCA 1330 

GAATGAAGTC CTGGGTCAGG AGCCCAGCTG GCTGGGCCCA GCTGCCTATG TAAGGCCTTC 1440 

TAGTTTGTTC TGAGACCCCC ACCCCACGAA CCAAATCCAA ATAAAGTGAC ATTCCCAAAA 1500 

AAAAAAAAAA AAAAAAAAAA ANCCCGNGGG GGG 1533 
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(2) INFQRMAT ION FOR SZQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 575 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
.(D) TOPOLOGY: linear 

(Xi)' SEQUENCE DESCRIPTION:. SEQ ID NO: 91: 

ATCCTCTGGA ATCTAGGTGG AAGCCACCAA GCCTTCTTCA CACTTGCGTT CTGAGCATCT 

GCAGACTTAA CCCCATGTGG CAATCACCAA GGCTTATGGC TTGTGTCCTC CAGAACTGTG 

GCCAGAGCTG TACCTGGGCC CCTTTGAGCT GAGGCTGAAG CCAGAGTCTG AAC-CTCAGCA 

GGGCAGTARG GCCCTGGGCC TGGCCCCTGA AACCATTCTT TTCTCCTAAG CCTCTGGGCC 

TTTGATGGGA RGGGCTGTCC TCAAGATTTT TGAAATGCCT TTGGAGGGTT TTTGCCTTGT 

CTTGGATATT GGCTTCCTTT TAGTTATGCT CATCTCTCTA GCAAGTGAAT GTTTCACAAC 

CTGCTTGGAT TCTTTCTCTA CCACAGARCC AGGCTGGAAA TTTTACAAAC TTTTACACTC 

TGTTTCCCTT TTAAATATAA ATTTCAATGT TAAGTCACTT CTTTGCTCCC ATATCTGATT 

TAGGTTGCTG GAAGTAGCCA AGTCACCTCT TGAAT GCTTT GCTGCTTAGA AATTTCCTCT 

ACTAGGTAGC CTGGGTCATC ACACTTAAGT TCAAA 

(2) INFORMATION FOR SEQ ID MO : 92: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : ' 63 9 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : 'double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
TCCTTTCATC TTAAGCACCA CCCGACAGGG CAGGTACTAT TACCATCTCC GTTTGACAGA 
TNAGGAACCT GGCACAGGAA GCATTTAAGT GGATTCCCCA GGATCGCCCC ACTGTCAGGA 
GCAGANTCAG AATGGGCCTC AGGATCAGGC TCCCAATCCT GGCTTCTAAC TGCTGGGCTC 
TGCCCTTCYC TCWCCCCACC TCCCCACTCC AGTGCCTTTG GTCATGCCAC TGCAGCTTTC 
AGGCCAATAC TGGATTAGCC TCTTAGTGTT CTTGTCCCTG G\GCCATTTC CCCAGGCAGC 
AATTCCATGT GCCCTCACTG ATCTAGGTGG CTCTTGTGTC ATTTGTCACA TCCTATTGAA 
TTGTTTATGC ATCTTGTTCA CACTCACAGC ACCCTCCCTC TCACACGTCC TCCTTATAAA 
AATGTCCCTC AGTGTCTGCT ATGAGCCAGG TGCAGACTTA AGTGACAGGG CTGCTACGGG 
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'(2) INFORMATION FOR SEQ ID NO: 94: 
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AAATAAAAAA TTAACAAGGA GCACCTC-CCT CTTAATGCAC A-GTAACAAAC TATGTTAAGT 540 
GTCAGGAAGG AAAGGTTAAG GATGCCAGGA AGGCTTTTAA TAAATAACCT GACTTAGATG 600 
5 GGCAGGTGGT GCTGAP.GATT AAGAACGTGT TCTTCTCGA 639 

10 (2) INFORMATION FOR SEQ ID NO: 93: 

(1) SEQUENCE CKARACTERI ST ICS : 

(A) LENGTH: 744 base pairs 
(3) TYPE: nucleic acid 
15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

t:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

20 GAATTCGGCA CGAGAGTGGC TGGAGTCTGG CTGCAGAGGG AAGACATCAG CAGGGAGGGA 60 

GCCAGGGCCT GTCACATCTT TCCTCTGGCC ATTGTCCTGG TCTTTGTAAG CCCAGAATCT "120 

CCCCTTCCCT GAAGGGAGGC CAGCACCCCA GGAGGGO.GC AGGTGTGCTG TGAGGGTTGG 130 

25 

AGTAGTGTGA GAGGTCAGGG TACACTAGAA ; TGGCCA.TGGA CACCATGTGG GGGTGCTCTG 240 

GGCTGGGCCA CAGAACAGTG TCCTTCCTGC TGCTCCTCCC CTGCAGCTTC CCCCGACCTT 3Q0 

30 GTNGTTTATT TGGTTTGATA CCAATCAGCA GACCCTGCAA GGTGGAAGCT CCCAGGCTCT 350 

CAGTCCCACS ACTCTCATGT GCCAGTCACC CMT ACTGTAA CTGCCCAATG AGTACTTCTT 420 

GCCCACTGCC AAGATAGAGC CAGTTTACCA AGACAGGGGA ATTGCAGTAG AGAAAGAGTT 480 

35 

GAATATACAT AGAGCCAGCT AAA.TGGGAGA GTGGAGTTTT CTTATTACTT AAATCAGCCT • 540 

CCCYTAAAAT TCAGAGGTGA GAATTTTTCA AGGACAGTTT GGTGGSCAGG CCTAGGGAAT 60 0 

40 GGATGCTGCT GATTGGCTAG GGATGCAATC ATAGGGGTGT AGAAAAGTWC CTTCTGCACT 660 

GAGTCCACTT TTGGTGAGAG CTACCAAGGA GCTGCTGGTC TGCTGGTCCC GGTAGA.GCCA 720 

TCTGGTGTCA GGAATGCAAA AGTG 744 

45 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 525 base pairs 
(3) TYPE : nucleic acid 
(C) STRANDEDNESS : double 
55 .(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GCAGGGGAAT TCGGCCACGG AGGGGTTTCA ACAGGGC CCG TGGGGTGAGG TGCAP^ACACA 60 

60 
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AAGCCCATAA GTGCTGGCCT GTTGGGACAA ATGAGAGAAA. TCCCATAGGG TGGTGATGAC 
AGCGCAYTCA GCCATCYTAY TCCTGGGGAA AATGAAACTT GTGCTCCTAT C-AATGCTCA 
GTTGTAAAAC TGGAAAAAAA TTTTAGAAGA CATCTTGTCC AGCATCTGTG TTTATGTCTA 
TAAAATGTAG AAAACTAAAG CACAGAGATG TTAAATGTTT TGTCCAAGGT CCAACAGCTG 
GTTAGCARGC TTGGTCTGGT GACCTTTCTA CTGAACCACA GTGCCGCTGG GGGAAGTCCT 
CAGCACAGAT GGCTGCTGCT ATAGCTGGGG TATGGGCAGT ATTAGTAGTT AAC-CAGTCAA 
CCCAAGTTCC CATAGTCTAG GTTCTGCTTC AGCTGGAGGT TAGGGAAAAA CACAAGAAAA 
TCCCTTACCA CTCTACCAGT GCTGGGGGAT GTACTAAGAG ATCCCC 

(2) INFORMATION FOR SHQ ID NO : 95: 

(i)' SEQUENCE CHAFACTEF.ISTICS : 

(A) LENGTH : 42 5 base pairs 
(3) TY?E: nucleic acid 

(C) STHANDEDNE3S : double 

(D) TOPOLOGY: , linear 

(:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
GGCACAGGGC AGGAGAGACT TGGTCCATGG GGAGAAGCCT GCAGTATAGA TGGGACCTCC 
AGGAGCCCAA GTAGCATAGA CCCTGCTGAT CCGGGGCCAT TGAGCCAGAG GATTTGGGCT 
GAATGTCCCC AGAGACAAAA GGGAAAGGTA GATCCTTTCC CTTAAAGATG AAAGCCATCG 
CCCGGGCTTG CTTATTGCTC TCTCTCCTGG TCCTTCCACA TGTTGTTTCT GAACATTTGT 
TCTGGCATCA CAATCCCCGT CATCCTGTCA TCTGGCCCTT CCC\CCTTTC CAC CTTATCT 
CTTGCAGTGT CTCCGCGTCG ACCTGGCACC TGGGTGAARG CTTGCTCTTG CTGGTGCCCA 
TAGCCCCCAG TGTATGGTOT TGAMCTCCCC AGCGATATGG APACCCACCT OVGGAGGGCC 
CCTCGA' 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 344 base pairs 
(3) Tf?E : nucleic acid 
(C) STRANDEDNESS : double 
CD) TOPOLOGY : linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: - 

GGCACAGCGG CAC GAGAT AG GAAGCTTGGC AGGGGCAGCT CCCCCAGTGC GCATTGCCCT 
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GTAACTCGAG CGCCTGGGAG TGGGGAGAGG CTTGGAAATG GAGCAGGGTG GTGGACCTCG 120 

TCTTCTCCTG CTCATCCCAG GCCTCCTCCA TAACACCTAC CTAGCACGGC CTGGGGACTT 130 

CCCAGCCCAA GGAA.CAA.CTG AGAATACTGA GTGCCAGC-GT AGCCCTAGCC CCATTTCACA 240 

CCTGGGCAAA GTGAGGTCAC TGGATTCAAA CACTCAGATT TAAACCTCCT CTGTGTCTGC 300 

AGCACCTGTA TATAACTGCC AGCCTCTGCT GCCCCTCTCC AAAAAGTCTC TGCCCTTGTC 3 60 

TTTGGCACCT GTCTCTGTCC TCCCCATTCT CTGCTCCTCC TTTCTCCAAC TCAGANTCAC 420 

CCTGTTAGTT C-JGCAAATGT TCATCGAGCT CCATAATGTA GCAGGACAGG MCTGTCTAAC 4S q 

15 AGATTCTGGN CTTGCAAGGG TGAGACAAGT ACTCTCCATC TTTCTCTCAT CTTCACAGAT 540 

GGTCTGCTCA ACAACTTTGC ACTGAATTGT AAATAATTGA TACTGCATAA AACATTGATG 600 

TTCTTTAAGG GTAGTCCACC AAGGTGGCAA GTCTTATAAT GATAACTGCT CAAGGATCTC 560 

TCAGTGAAGC ATTTGGGGST GCTAGCTCTG CCTATGGGTG AGGTCAGCTA TCTCACGCCA 720 

TCTACTTCCA CNTGCCCCCC CATGCCAGGG TCACCCTGAG CTGAGATGCC TGAGCAGGTG 730 

25 GCAGAAAGGA GCGACCTGGT TTATGCTTCG GGACCACAAA CTCCTCTATC CAGAMGACAG 34Q 

TTTT 344 



20 



30 



(2) INFORMATION FOR SEQ ID NO: 97: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1985 base pairs 

-(B) TYPE: nucleic acid 

( C ) STRANDE2MES S : do ub I e 
'(D) TOPOLOGY: linear 

40 (:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

MCCCTGCTG AAGTACAGGT TCTTCTATCA GTTTCTGTTG GGCAATGAAC GAGOtACAGC 60 

AAAGGAGATC AGGGATGAAT ATGTGGAGAC GCTGAGCAAG ATTTACCTGT CTTACTACCG 120 

45 

CTCTTACCTG GGGCGGCTCA TGAASGTGCA GTATGAGGAA GTCGCTGAGA AAGATGATCT 130 

AATGGGTGTG GAAGATACAG CAAAGAAAGG ATTCTYCTCA AAGCCATCGC TCCGCAGCAG . 240 

50 GAACACCATT TTCACCCTAG GAACCCGCGG CTCTGTCATC TCCCCCACTG AACTTGAGGC 3QQ 

CCCCATCCTG GTGCCTCACA CAGCGCaGCG GNAGAGCAGA GGTATCCAIT TGAGGCCCTC 360 

TTCCGCAGCC AGCACTACGS CCTCCTAGAC AATTCCTGCC GCGAATACCT TTTCATCTGT 420 

■55 

GAATTTTTTG TTGTGTCTGG CCCAGYTGCA CACGACCTGT TCCATGCTGT CATGGGCCGT 43 0 

ACACTCAGCA TGACCCTGAA AGACCTGGAT TCTTATCTAG CTGACTGCTA CGATGCCATT 540 

60 GCTGTTTTTC TCTGTATCCA CATTGTTCTC CGGTTCCGTA ACATTGCAGC AAAGAGGGAT 600 
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GTTCCTGCCC TGGACAGGTA CTGGGGAACA GGTGCTTGCC TTGCTATGGC CACGGTTTGA 660 

ACT.GAT.CCTG GAGATGAATG TTCAGAGCGT CCGAAGCACT GACCCCCAGC GCCTAGGGGG 720 

GTTGGATACT CGGCCCCACT ATATCACACG CCGCTATGCA GAGTTCTCCT • CCGCTCTTGT 730 

CAGTATCAAC CAGACAATTC CTAATGAACG GACCATGCAA TTGCTGGGAC AGCTGCAGGT _ 340 

GGAGGTGGAG AATTTTGTCC TCCGAGTGGC AGCTGAGTTC TCCTCAAGGA AGGAGCAGCT 900 

TGTGTTTCTG ATCAACAACT ATGACATGAT GCTGGGTGTG CTGATGGAGC GGGCTGCAGA 960 

TGACAGCAAA GAGGTTGAGA. GCTTCCAGCA GCTGCTCAAT- GCTCGGflCAC AGGAATTCAT 1020 

TGAAGAGTTG CTGTCTCCCC CTTTTGGGGG TTTAGTGGCA TTTGTGAAGG AGGCTGAGGC ■ 1080 

TTTGATTGAG CGTGGACAGG CTGAGCGACT TCGAGGGGAA GAAGCCCGGG TAACTCAGCT' 1140 

GATCCGTGGC TTTGGTAGTT CCTGGAAATC ATCAGTGGAA TCTCTGAGTC AGGATGTAAT 1200 

GCGGAGTTTC ACCAACTTCA GAAATGGCAC CAGTATCATT CAGGGAGCGC TGACCCAGCT 1260' 

GATCCAGGTC TATCATCGCT TCCACCGGGT GCTGTCCCAG CCGCAGCTCC GAGCCCTCCC ■ 1220 

TGCCCGGGCT GAGCTCATCA ACATTCACCA' CCTTATGGTG GAGCTGAAGA AGCATAAGCC 1380 

CAACTTCTGA TGTGCCAGAA ACCGCCCTGA GATCTGCCGG TCATCTCCAT GGACTTGTGC 1440 

ACCCCATTCC ATACCCTTCT TCACCTGGGG TACCCCTTCC AGTTTTCCCC TTGCTTCCCA 1500 

GGCCCTTGAC ATGGCTTACC TGCGTTCACT CCCAGCACCT TGCCGAACAG GATAAGCTGG ' 1550 

ATCCCCTTGG GCTTCTGAAT ATCCCAGTGT CTTCAGGTTT CCGAAGACCA CTTCCCTGTG 1620 

GGCTTCCAAA ATGGCCTTTA TCATTTCTCC AGTCTGTCAC CCTCCTTTGC TGCTCCCATA 1630 

CACCCAAGGC TTGTTTCTTC CCCTGTAAAA ACCACTGCCT CAATCTCTGG TTCACTGAAC 1740 

TAGTCACCAT GTCCTGAGGC ATGAAGCCTC CTCAGCTCTT GGAATTGCTG GCAAGGGGTG 1300 

ACTGCCTCTG AGTCATTGTG TTTTTCAAAG TGATTTCTTT TCTGTAGCTT TTTGACCTAA I860 

GATCTGAGCA ATTTGAACAC TAACCTCTCC CCTCCTGGCT CAAGAATTAC TCCGAACTCA 1920 

GTCTGCAGAA AATAAATATT TAGTATGACA TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1930 

AAAAA 1935 



60 



(2) INFORMATION FOR SEQ ID NO : 93: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1416 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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SEQUENCE DESCRIPTION: SEQ ID MO: 93: 

ATATGAAGGG AAAGAATTTG ATTATGTTTT CTCAATTGAT ■ GTCAA.TGAAG GTGGA.CCATC 60 

ATATAAATTG CCATATAATA. CCAGTGATGA CCCTTGGTTA ACTGCATACA ACTTCTTACA 120 

GAAGAATGAT TTGAATCCTA TGTTTCTGGA TCAAGTAGCT AAATTTATTA TTG ATAACAC 130 

AAAAGGTCAA ATGTTGGGAC TTGGGAATCC CAGCTTTTCA GATCCATTTA CAGGTGGTGG 240 

TCGGTATGTT CCGGGCTCTT CGGGATCTTC TAACACACTA CCCACAGCAG ATCCTTXTAC 300 

AjGGTGCTGGT CGTTATGTAC CAGGTTCTGC AAGTATGGGA ACTACCATGG CCGGAGTTGA 360 

TCCATTTACA GGGAATAGTG CCTACCGATC AGCTGCATCT AAAACAATGA. ATATTTATTT 420 

CCCTAAAAAA GAGGCTGTCA CATTTG AC CA AGCAAACCCT ACACAAATAT TAGGTAAACT 480 

GAAGGAACTT AATGGAACTG CAC CTGAAGA GAAGAAGTTA ACTGAGGATG ACTTGATACT 540 

TCTTGAGAAG ATACTGTCTC TAATATGTAA TAGTTCTTCA GAAAAACCCA GAGTCCAGCA 600 

ACTTCAGATT TTGTGGAAAG CTA.TTAACTG . TCCTGAAGAT ATTGTCTTTC CTGCACTTGA 660 

CATTCTTCGG TTGTCAATTA AACACCCCAG TGTGAA.TGAG AACTTCTGCA ATGAAAAGGA 720 

AGGGGCTCAG TTCAGCAGTC ATCTTATC AA ' ' TCTTCTGAAC CCTAAAGGAA AGCCAGCAAA 730 

CCAGCTGCTT GCTCTCAGGA CTTTTTGCAA TTGTTTTGTT GGCCAGGCAG GACAAAAACT 940 

CATGATGTCC CAGAGGGAAT CACTGA.TGTC CCATGCAATA GAACTGAAAT CA.GGGAGCAA 900 

. TAAG AACATT CACATTGCTC TGGCTACATT GGCCCTGAAC TATTCTGTTT GTTTTCATAA 960 

AGACCATAAC ATTGAAGGGA AAGCCCAATG TTTGTCACTA ATTAGCACAA TCTTGGAAGT 1020 

AGTACAAGAC CTAGAAGCCA CTTTTAGACT TCTTGTGGCT CTTGGAACAC TTATCAGTGA 1080 

TGATTCAAAT GCTGTACAAT TAGCCAAGTC TTTAGGTGTT GATTCTCAAA TAAAAAAGTA 1140 

TTCCTCAGTA TCAGAACCAG CTAAAGTAAG TGAATGCTGT AGA.TTTATCC TAAATTTGCT ,1200 

GTAGCAGTGG GGAAGAGGGA CGGATATTTT TAATTGATTA GTGTTTTTTT CCTCACATTT 1260 

GACA.TGACTG ATAACAGATA ATTAAAAAAA GAGAATACGG TGGATTAAGT AAAATTTTAC 1320 

ATCTTGTAAA GTGGTGGGGA GGGGAAACAG AAATAAAATT ' TTTGCACTGC TGAAAAAAAA 1380 

AAAAAAAAAA AAAAGGAAAC TCGAGGGGGG GCCCGG 1416 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CKARA.CTERISTICS : . 

(A) LENGTH : 193 5 base pairs 

(B) T/?E: nucleic acid 

(C) STFAMDEDNESS : double 

(D) TOPOLOGY: linear - 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

NTCTACCCTA ATCAAGATGG GGACATACTT CGCGACCAGG TTCTTCATGA ACATATCCAG 60 

' 5 . • 

AGATTGTCTA AAGTAGTGAC TGCAAATCAC AGAGCTCTTC AGATACCAGA GGTTTATCTT 120 

CGAGAAGCAC CATGGCCATC TGCACAATCA GAAATCAGGA CAATAAGTGC TTATAAAACC 130 

10. CCCCGGGACA AAGTGCAGTG CATCCTGAGA ATGTGCTCTA CGATTATGAA CCTCCTGAGC 240 

CTGGC CAATG AGGACTCTGT CCCTGGAGCG GATGACTTTG TTCCTGTGTT GGTGTTTGTG 300 

TTGATAAAGG CAAATCCACC CTGTTTGCTG TCTACTGTGC AGTATATCAjG TAGCTTTTAT ' 360 

15 

GCTAGCTGTC TGTCTGGAGA GGAGTCCTAT TGGTGGATGC AGTTCACAGC AGCAGTAGAA _ 420 
TTCATTAAAA CCATCGATGA CCGAAAGTGA CCAAGACCAA GGCCCACCAA GGCA.GCAGAC ■ 430 

20 TGTTAATCAG ACAAACAGAT CTCTGAGAAG GTGCATCAGC TGCTTTGAAG GCTGAAGATT 540 , 

GTTTTGTATG ATACTGCACA GCATCAGGCA TTTTAAAGCA GATCTTTACT AAACAGGTTA 600 

• a TGAGCTAAC AAGCAGGTTC TCTCGTCTTT GGGCTCTTTC CTTTCTGAGT TGCA-TATTCT 660 

25 

ATTTTCTTGT CCCCAAGTAG AGACTAGTAC- TACAAAAAGG GACCACATTT TTCAAGTATT 720 

TCTAAGTATA AAAAACAAAA CAAAAATCTC TTAGGAAATG TCTAGACCTC CATTCTTGGA 730 

30 TTCCCTTTCT TTCCTTTTAT TTTAAAAAAG AACAGTACCC CTCTTTTAAG ATGCTGTCTT 840 

ACATTAATC-A GCATCTAATG GAAAGAAGGT ATGAGTTGCA CTGAGGATTA GAATAGTGGT 900 

GCGTTAGTGG CA.TTATCTAT AAATACACTC ACCTAAATTG AAAGCTAA.GA AGGAAATGTA 960 

35 

AATATAATAT ATATTTATAT TTGATGTAAT A.TGGACATCT GCAGATTCTA ATAAACAAGG 1020 

ACTATTGCTG ATAGTAGGCT GTGACATACT GTCTTGTGAA ATGGTTTCCT TGACAAAATT 1080 
40 TAAGCTGAGC TTAAAAGCAA AAAAACAAAA AGTACACAGA AATATTTATT AAAATGTAAT ■ 1140 

ACAGTTTATT GAACTTTCTA GGTATGGAGT . TTGATGGACA GGGCTGCCW TAATGAGTGT 1200 

GAAGGTCACT AAGTCACTTA GACATCTCAC CGTGGAAGTT TGTGAGCCTG CATTAGGAGA 1260 

45 

TAGACTGATT ACCATACATG ACATAAAAA.G GAACAGTGGA TAGCTCATAC TTTATGGTGG 1320 

TTCTTCTCCT CCGAAATAAT ATACTGCAGA AATCCCAGAC AGAGCTCCTT ACAAACCTTT 1380 

50 AATTGTAATA TATTTTTGAT GATTATTCAC ATTGAATGCA CAGACCAAGA ATTCAGTGAA 1440 

v. - < 

TGTCATTTTT TAAAAAACTA ATTTGTATTG TCTGCTCTAG TGATACAAGT TTTACTAGTG . 1500 

ATAAACTATT TTAATCAACC ATACTATTCT TATGGAAAAA AATATCTA.TT TTGGCAGGTT 1560 

55 

TCTGTGCCTT TATTTCCCTC TTCTGAAAAA AAGTCTGTGT TTTCATAGTT TGGTTTGCAT 1620 

TGTATATCAA TAATTAATCA GGAATGGGTT TTGGTGCCTG AAAAATTGGC CATGGAGGCA 163.0 

60 CACCAAAjGCT TCAAGCACAA GTCTTGTACA TGGC-CCATCA CTGTCTGGTT TCACTTCGTG 1740 
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TGTTTCCTAA ACACATTTAG CTGCTTTTTT AACAAi.CTCA GCCCCATACT TGAGTCCCTT 1300" 

GTTGTTGGGA GCATTTCCAG GCATCTTTTA AGGGAACTGT GACAAACAGC CTCGGGCAGA I860 

TGAACACGGA GGCTCTCTGT TGTCTGTCTC TGAGATCTTT GTGTCTGGGA ATGCCTAAAG 192 0 

NTTTTGNTTT TTTTT 1335 

(2) INFORMATION FOR SEQ ID NO: 100: 

15 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: ' 599 base pairs 
(3) TYPE: nucleic acid 

(C) STPANDEDMESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID, MO: 100: 
GAATTCGGCA CGAGCGTCCA CGCAGCCGCC GGCCGGCCAG C^CCAGGGC CCTGCATGCC 60 
25 AGGTCGTTGG AGGTGGCAGC GAGACATGCA CCCGGCCCGG AAGCTC CTCA GCCTCCTCTT 120 
CCTCATCCTG ATGGGCACTG AACTCACTCA AGACTCCGCT GCCCCCGACT CCCTGCTGAG 130 
AAGTTCAAAG GGCAGCACGA GGGGGTCTTT GGCTGCTATT GTCATCTGGA GGGGGAAGAG 240 

30 

TGAGA3CCGG ATAGCCAAGA CCCCAGGCAT TTTCAGAGGT GGCGGGACCT TAGTCCTACC 300 
CCCAACACAC ACCCCTGAGT GGCTCATCCT CCCTTTGGGC ATAACGCTGC CCTVGGGOGC 3b0 
35 TCCAGAAACA GGCGGTGGGG ATTGTGCCGC TGAGACCTGG AAGGGCAGCC AGCGTGCCGG 420 
CCAGCTGTGT GCATTGCTGG CTTAATATGC AGGGCTTGGG GGGCTGTGGC CACATGCCCG 430 
GCAGGAGGTG AGTGAGGAGC CCTGTGGCGT GCTGGTGTGG GGATCGTGGG C^TTTCAAAC 540 
GGGCTTGTCG TACCCTGAAC AATGTATCAA TAGAGAAAAA. AAAAAAAAAA AAAACTCGA 599 



40 



45 



DO 



(2) INFORMATION FOR SEQ ID MO: 101: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 734 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

• GAATTCGGCA CAGAAAAAAA AGAGAGACTG GGTCTTACTG TGTTGCCCAG ACTTGTCTTG 60 

AACTCCTGCC TCAGCCTCTC AAGTACTTGG GATTATAGGC CAAGAAGCCA CCATGCCTAG 120 

60 CTTCTTCCTG TCATTGATCC AGACTAATAC TCTGGGGTCA GCCTCATTTC TTCTCTTTCT 130 
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CACTTTGCAC ATCCACTTGT CACCAAATCK RGTTCATTCT GCATCCTAAG TAAGTCCTTT ■ 240 

GATTCCTCCA GTTGTTCA.TT AGTAATGTCT CAARTGTAAT TTTTTCTAGT AGTTTTCAGC 300 

CTGTCTTTCC KGCCTTCAGT CTTAACTTCT CCAGTACATA KGCCACATTG TTGTCAGCAK 360 

GATCAWATTT TATTTAAAAA TACTTTACAW AKGTTTATKG CCAAATATTA GRAAATACAG 420 

10 ATTCATGGAA AGAAAAATCA CTGTCCCAAG GAGGTCACTG GCATGGTGAG GTTAAGGGGT 430 

GATTTTAATT TTTAAAAATG TATATTTTTT CCTGTGTAGA GTAGTAACAC CCTTGAAAAC 540 

ACAWTCCCTT GTAAAGTCTC TAATTCTGTA CTCCGCATCT AG3TGRTCTC TTCT TT CTCA 500 

GATATTTTAC AATTTCATTT ATGACCACCT TTCTCTAGCC TTTACCCGTC TCTTCAATAT 660 

TWACA.TATGC AGAAGTTTCT CCTAACAAAC ACCTGCCTCT GCCTCAGTTC TGCTACCACC ,720 

20 CTGTTGCTTT CTTTCCCTTC ACAATCAAAT TTAAGAGTGT CAAAAAAAAA AAAAAAAAAC 7 30 

TCGA 734 



25 



(2) INFORMATION FOR SEQ ID NO: i'Q2 : 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: IQ35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D ) TOPOLOGY : linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 102 : 

AGAGGCCTGG CTGCGTTGCC CTATCTCCGT CTCCGCCACC CACTTAGCGT TTTAGGCATC 60 

AATTACCAGC AGTTTCTCCG CCACTATCTG GAAAATTACC CGATTGCTCC CGGCAGAATA 120 

40 ■ 

CAAGAGCTTG AAGAACGCCG CAGTTGCGTG GAAGCCTGCA GAGCAAGGGA AGCAGCGTTT 130 

GATGCCGAAT ATCAGCGAAA TCCTCACAGG GTGGACCTCG ATATTTTAAC CTTTACGATA 240 

45 GCTCTGACTG CCTCTGAAGT TATCAACCCT CTGATAGAAG AACTTGGTTG CGATAAGTTT 300 

ATCAATAGAG AATAGTTAGG TGGTGACACT ACTTCAAGAG AACCTCTGCA TTCCAGTCAT 360 
ACCAATCCTG CAACTTGATT TTCAGAAGTC AAGAGTATAT CGCGATAAGA CAGTGCACAG ' 420 

50 

; GTGGAGGGGA AAAAAAGGGG GAGGGGGAAG CTTATCTTGA AAAAGCATCA CAGAAGTAGA 430 

AAAAAATGTC GAAAGCATTA ^ TAACTGTAAC GTTCTTTGAG TTTGTGATTG ATCCACATTT 540 

55 ' TTCCCCCTGC ATTATGGAAA ATGTCTCTCA GCATTGCTTT ATTACAAAGT AAAGGATGGT SQQ 

TTTATAAAAT TGAGACTGAT GAAACATCAA TACTAG AGO C CATGAGGATG AAAGAAATTA 660 

TCAAATAGTG CTGAACAGAA TAAGATGTTA ACGCTGAGTT ATTAGGACTG GAAGGCTATG 7*2 0. 

'60 
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AAAAGAACTT GAAATTGTCG GAATATGTGC TCTCTTCATG TCATATTCAA TAGAAGTTTC 730 

TAGTTTAAGA TTGATTTTGT GTTTTCTTAG . GCATTTCAAG TGACAAGCAA. AGTAAATGTA 840 

TATATTATGT G AT-- AATC AT GTTTTCAA.GA ACGTCAAATT TCTGGACTTT TTT.CTTTCAA 900 

TTTTTAATTT TTAAAGTTTT TTTGGTATTA AAAAATCYAT TCACAACOGA AAAAAJ1WTWT 960 

WAAATWTWQ4 GCGAAAAGCC AAAAAAAAAA AAAAMMftGGG GGGGCCGGGC CCCATCCCCC 1020 

CAAGGGGGTG CMGNT 103 5 

(2) INFORMATION FOR SEQ ID NO : 103: 



(i) SEQUENCE CHARACTERISTICS: 

-(A) LENGTH: 2213- base pairs 
20 (3) TYPE: nucleic acid 

(C) STKANDEDNE3S : double 

(D) TOPQLCGY: linear 



{XL) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

AGGTATTAGG CCCTTTTGTG GGAGC CCCAT GTTTTGTTTT TCTGAGTTGG TGGGGAGGGA 60 

SGGAGGGGGA GGGCTGAATT GTTTTGCAGA GGAAGATGGC ATCTGTGCTT TAAATTTCTC 120 

30 ATTACTGGGT TAGAAAACAA. AGAGGGAKTG CCCTGCACAT TTTCTTTTGT GCTTTTAAAT ' 13 0 

GTTTCTTAAG TTGGAACAGG TTTCCTCGGG CCTGTTTTGA CTGATTGCTG GAGTGCATTT 240 

' * GATAGTTAAA AATTACTAAT TGGTTTTATT TCCCTTCACA CTCTGCCTCC CCACTTCTCC 300 

35 

CCCCGTTACT GPiAAAATAAC CATTTTAGTG TCAGGCTAGA A-.TTGAVTTG CTGAGTTTTG ' 360 

TGTATCCTTT AAATT&AAAA C-CACAAGTGT TTATTGTAGT GGTTAAACTG TAGCATCTCA 420 

40 GCATCTGGGT GGAAGCTGCC TATATTTCTT CCCAGTTT.AA CTGGGGACCA TCTGTGAAAT 430 

TAA.TTTTCCA TCCAGACAGC TGCTGTGAGC AAATGAACAT AAATGCTCGC TGGAAATTTA 540 

CTAACCAGTT TTTATATTGA CCTGCAGTGT AWAjGCACA TTTAATTATA AACAATATAT 600 

45 

TOAAATGGG CAAATTTTAT TTTCAAATGC AGTGTAGAGC TAGATTAAAA GCAACTCTTT 660 

GCCACCTACT CTGCCCTTTT GGCAAAGTTA CCTTGAACAA AGAATCTTAA; GGGTTTATTA 720 

50 AGAACTCTTT ■ ATTTTCTTCA TACCCTGTTC TCTGCAGTGC TTTCTAACAG CTTCTGGGTG 730 

CAGATTTTCT TCGGCATCCT TTTGCACTCA GCTTATTACA GGTAGGTAGT GCTTAAGAAA 340 

AGTCATGGAG GACTAPAGCC TAAGTCCTTT TCACTTTTCC TCCATCTGAA GGTAGGTGAG 900 

55 

TTCATCCTCT TCATAGTAAT GCTGTTTTAC CAAGACTTTA TAGCAGATGG ACCCAGAAAG 960 

AATTTTCTGC TATTGTGTTC ACTACAACAG GATAGGGACA TCAGACAGCC OCAGAAACCC 1020 

60 CTTCCAGATC TGATATGGGA CTATTAATTT TTATGCTGTT A-.TTGGTATT CATTCACAAT 108 0 
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GCAGTTGAAG -GGGGAAGGCT CCACTGCATT CTTTGGCTAA GGCCTGAATG CTTGCTCATC 1140 

TGTAAGATCT atactcgagg ttttgttttc cttttaaaat tctttaggga. GAGAGGGATG 1200 

5 

GTTTCTGAGG GGTTCTGAAA. GTATGATTCA ATGTGCAACA TACAGGTAGG TCTTCAGCAT 1260 

AAGCTGAAAT ATATGCATGT AAAAACTTTG ACATCTTTTT TTTTAATTTT CCACTTTCTT 1320 

10 CTTAACTTTA CTTCTCTTTT TGTCCCCCCC CCATCTTACA GAA.GTTGAGG CCAAGGGAGA 1330 

ATGGTAGGCA CAGAAGAAAC ATGGCAAACT GCTCTGTGCT TTCAAACCAA AGTGTTCCCC 1440 

CCAACCCCAA ATTTGTCTAA GCACTGGCCA GTCTGTTGTG GGCATTGTTT TCTACAACCA 1500 

15 

AATTCTGGGT TTTTTTCTTC TTTCTTTAAA CATAGAGGTA CCACCACAAG GGATGCCCTA 1560 

■ CTCTCTCGCA GCTCTTGAAA GCATCTGTTT GAGGGAAAGG TCTCTGGGCA AGCAAGTGGT 1620 

20 TATTTGGATT GCTTGCTTCC CTTTTTCCAC CTGGGACATT GYAATCATAA AATAACA.GTA. 16S0 

AATTCCAAAC CTCAAAAACT ATTATGGCCT GAGCACAGCT GAAATCTAGC AGAGTTTAAC 1740 

TCTTCTGCCT CCATGTCTGT CACTTATAAT TCAGGTTCTG CTGTTGGCTT CAGAACATGA 1300 

25 

GCAGAAGAAT CGTTTTATGC TAGTTATTGy ATTCATGGTT GAAACTCAAC TTAGGGAAAG 1360 

GGTTCCAATG TATTAA.GCAA TGGGCTGCTT CTCCCCAATC CTCCCTAACA ATTCGTTGTG 1920 

30 TGGACTTCTC ATCTAAAAGG TTAGTGGCTT TTGCTTGGGA TCAGTGCTCT CTATTGATGT 1930 

r 

TCTTGCTGGT CTCCAGACAC ATTCCTGTTG CATTAAGACT TGAAAGACTT GTAGATGTGT 2040 

GATGTTCAGG CACAGGATGC TGAAAGCTAT GTTACTATTC TTAGTTTGTA AATTGTCCTT 21Q0 

TTGATACCAT CATCTTGTTT TCTTTTTGTA GGTATAAATA AAAACACTGT TGACAATAAA 2160 

. AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2213 



35 



40 



. (2) INFORMATION FOR SEQ ID NO: 104: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE nucleic acid 

(C) STHANDEDNESS : double 
<D> <TOPOLCGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
CTTCACAGAC TGACAGAATG GTTTTGTTTT GTTTTGTTTT GTTTTGTTTT GTTTTTGAGA 60 
55 TGGACTCTAG CTCTGTCACC CAGGCTGGAG TGCAGTGGTG CGATCTCGGC TCACTGCAAG 120 
CTCCGCCTCC CGGGTTCTCA CCATTCTCCT GCCTCAGCCT CCCGAGTAGC TGGGACTACA 130 
GGCGCCC1.CC ACCACGCCCG GCTAATTTTT TGTATTTTTT A.GTAGAGACG GGGTTTCACC 2 40 

60 
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ATGTTAGCCA GGATGGTCTC GATCTCCTGA CCTCGTGATC CGCCCGCYTC GGCCTCCCAA 300 

AGTGCTGGGA TTACAGGCGT GAGCCACCGT GCCTGCCCCA GAATGGTTTT T AAAGC C ACA J60 

5 GTTGAGARGC CACCCATTGC CCGGCGCCTG GACAGTGATC ATCTTGTTCA TCTTGTTCAG 420 

TGGTTTCTTG TGTGATTGGA ATTATTCATC CCCTTTGAAA GATGAGAAGG TTGAGATGCA 430 

AAGACTCTAC CTTTC'GAAGT TCTCACTGGT GGAAAGARCT AGAAGCACAG TTCAAAGTTC" 540 

10 

TGGNTTCTGG ACTCTGCAGT CCAGGTYTCC CTTYTCCCAC TTGCCTACCC TCAATGCCAG 500 

ACTGTTTTTG AAGTGGOCCA TAACTTGAAG GRAAAGTTTA AAGACAGTTC AATTTAATCA 660 

15 T CAGRATGC A TTCTTTTTTT TTTCGGARAC GGAKTTTCAC TCTTGCTGCC CASGCTGGAG 720 

TGCAATGG7G CAATGATGTC GGCTCACTGC AAGCTATGCC TCCTGGGTTC AAGNGATTA.T 730 

CCAGGCTCAG CCTCCCGAGT AGCTGGGATT ATGGGCGGCC ACCACCATGC CCAGCTAATT 340 

20 

TTTGTATTTT TTTTTTTAGT AGAGATGGGG TTTCGCCAGG TTGGGCAGGG TGKTGTTGTG 900 

AAYTCCTGGC YTCAGGTGAT YTGCCGACYT CATCYTCCAA AAGTGCTGGG ATTACAGGCA 960 

25 TGAGCCACTG CGCCTGGCYT CAGAATGCAT TCTTACACAT CTATCGTAGA CATTTATAAG 1020 
CACTCTAATG GA.TAACAATC CAAGAATAAA TGATTGTAAA AGATGATGGC GAAGAGTTGA ' 1080 

TGTCAATGTT TTTTTCCTAA GAAAAAAAGT CCGCGAGTAT TAAATATTTA GATCAATGTT 1140 

30 

TATAAAATGA TTACTTTGTA TATCTCATTA TTCCTATTTT GGAATAAAAA CTGACCTTCT 12QQ 

TTAATGATAT ACTTGTCTTT TGTAAATAGC AGGTTTTGTG TCATTCTCCC CACTTTATTA 1260 

35 GTTAATTTAA ATTGGAAAAA ACCCTGAAAC TAATATTGTT GTCTGTTCCA GTCTTATAAA 13 20 

TAAAACTTAT AATGCATGTA AAAAAAAAAA " A 1351 

40 

(2) INFORMATION FOR SEQ ID MO : 105: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 2Q6b base pairs 

(3) TYPE : nucleic acid 
' (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



GGCACGAGGC GGCGGAGGGG C^CAATCACA GCTCCGGGCA TTGGGGGAAC CCG^J^CCGGC SO 

TGCGCCGGGG GAATCCGTGC GGGCCCCTTC CGTCCCGGTC CCATCCTCGC CGCGCTCCAG 120 

55 

CACCTCTGAA GTTTTGCAGC GCCCnGAAAG GAGGCGAGGA AGGAGGGAGT GTGTGAGAGG 130 

AGGGAGCAAA AAGCTCACCC TAAAACATTT ATTTCAAGGA GAAAAGAAAA AGGGOGGGCG 240 

60 CAAAAATGGC TGGGGCAATT ATAGAAAACA TGAGCACCAA GAAC-CTGTGC ATTGTTGGTG 300 
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GGATTCTGCT CGTGTTCCAA ATCATCGCCT TTCTGGTGGG AGGCTTGATT GCTC CAGGGC 360 

CCACAACGGC AGTGTCCTAC ATGTCGGTGA AATGTGTGGA TGCCCGTAAG AAC CATCACA 420 

AGACAAAATG GTTCGTGCCT TGGGGACCCA ATCATTGTGA CAAGATGCGA GA.CATTGAAG 430 

AGGCAATTCG AAGGGAAATT GAAGGCAATG ACATCGTGTT TTCTGTTGAC ATTCCCCTCC 540 

CCCACATGGA GATGAGTCCT TGGTTCCAAT TCATGCTGTT TATCCTGCAG CTGGACATTG 600 

CCTTCAAGCT AAACAACCAA ATCAGAGAAA ATGCAGAAG7 CTCCATGGAC GTTTCCCTGG 660 

CTTACCGTGA TGAGGCATTT GCTGAGTGGA CTGAAATGGC CCATGAAAGA GT AC CACGG A 720 

AACTCAAATG CACCTTCACA TCTCCCAAGA CTCCAGAGCA TGAGGGCCGT TACTATGAAT 730 

GTGATGTCCT TCCTTTCATG GAAATTGGGT CTGTGGCCCA TAAGTTTTAC CTTTTAAACA 340 

TCCGGCTGCC TGTGAATGAG AAGAAGAAAA TCAATGTGGG AATTGGGGAG ATAAAGGATA 900 

. TCCGGTTGGT GGGGATCCAC CAAAATGGAG GCTTCACCAA GGTGTGGTTT GCCATGAAGA .960 

CCTTCCTTAC GCCCAGCATC TTCATCA.TTA TGGTGTGGTA TTGGAGGAGG ATCACCA.TGA 1Q2Q 

TGTCCCGACC CCCAGTGCTT CTGGAAAAAG ( TCATCTTTGC CCTTGGGATT TCCATGACCT 1080 

TTATCAATAT CCCAGTGGAA TGGTTTTCCA TCGGGTTTGA CTGGACCTGG ATGCTGCTGT 1140 
TTGGTGACAT CCGACAGGGC ATCTTCTATG CGATGCTTCT GTCCTTCTGG ATCATCTTCT " 1200 

' GTGGCGAGCA CATGATGGAT CAGCACGAGC GGAACCACAT TGCAGGGTAT TGGAA.GCAAG 1260 

TCGGACCCAT TGCGGTTGGC TCCTTCTGCC TCTTCATATT TGACATGTGT GAGAGAGGGG 1320 

TACAACTCAC GAATCCCTTC TACAGTATCT GGACTACAGA CATTGGAACA GAGCTGGCCA - 1330 

TGGCCTTGAT CATCGTGGCT GGAATCTGCC TCTGCCTCTA CTTCCTGTTT CTATGCTTCA 1440 

TGGTATTTCA GGTGTTTCGG AACATCAGTG GGAAGCAGTC CAGCCTGCCA GGTATGAGCA 1500 

AAGTCCGGCG GCTACACTAT GAGGGGCTAA TTTTTAGGTT ' CAAGTTCCTC ATGCTTATCA 1560 

CCTTGGCCTG CGGTGCCATG ACTGTCATCT TCTTCATCGT TAGTCAGGTA ACGGAAGGCC 1620 

ATTGGAAATG GGGCGGCGTG ACAGTCGAAG TGAACAGTGC CTTTTTCACA GGCATCTATG 1530 
GGATGTGGAA TCTGTATGTC TTTGCTCTGA TGTTCTTGTA TGCACCATCC CATAAAAACT . 1740 

ATGGAGAAGA CCAGTCCAAT GGAATGCAAC TCCCATGTAA ATCGAGGGAA GATTGTGGTT 13QQ. 

TGTTTGTTTC GGAACTTTAT CAAGAATTGT TCAGCGCTTC GAAATATTCC TTCATCAATG 1360 

ACAACGCAGC TTCTGGTATT TGAGTCAACA .AGGCAACACA TGTTTATCAG CTTTGCATTT 1920 

GCAGTTGTCA CAGTCACATT GATTGTACTT GTATACGCAC ACAAATACAC TCATTTAGCC 1980 

. TTTATCTCAA AATGTTAAAT ATAAGGAAAA AAGGGTCAAC AATAAATATT CTTGAGTATA 2040 

AAAAAAAAAA AAAAAAAAAA AAAAAA 2066 
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(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: I7G5 -base pairs 

(B) TYPE: nucleic acid 
■(C) STRANDEDMESS : double 
(D) TOPOLOGY : Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 106 : 

15 AATTCGGCAK AGGGCAGCTG TCGGCTGGAA GG AACTGGTC TGCTCACACT TGCTGGCTTG SQ 

CGCATCAGGA CTGGCTTTAT CTCCTGACTC ACGGTGCAAA GGTGCACTCT GCGAACGTTA 120 

AGTCCGTCCC CAGCGCTTGG AATCCTACGG CCCCCACAGC CGGATCCCCT CAGCCTTCCA ^80 

20 , 

GGTCCTCAAC TCCCGYGGAC GCTGAACAAT GGCCTCCATG GGGCTACAGG TAATGGGCAT 240 

CGCGCTGGCC GTCCTGGGCT GGCTGGCCGT OVTGCTGTGC TGCGCGCTGC CCATGTGGCG 300 

25 CGTGACGGCC TTCATCGGCA GCAACATTGT CACCTCGCAG ACCATCTGGG AGGGCCTATG 360 

GATGAACTGC GTGGTGCAGA GCACCGGCCA 'GATGCAGTGC AAGGTGTACG ACTCGCTGCT • 420 

GGCACTGCCG CAGGACCTGC AGGCGGCCCG CGCCCTCGTC ATCATCAGCA TCATCGTGGC 430 

30 

TGCTCTGGGC GTGCTGCTGT CCGTGGTGGG GGGCAAGTGT ACCAACTGCC TGGAGGATGA 540 

AAGCGCCAAG GCCAAGACCA TGATCGTGGC GGGCGTGGTG TTCCTGTTGG CCGGCCTTAT 500 

35 GGTGATAGTG CCGGTGTCCT GGACGGCCCA CAACATCATC CAAGACTTCT ACAATCCGCT 660 

GGTGGCCTCC GGGCAGAAGC GGGAGATGGG TGCCTCGCTC TACGTCGGCT GGGCCGCCTC 720 

CGGNCTGCTG CTCCTTGGCG GGGGGCTCCT TTGCTGCAAC TGTCCACCCC GCACAGACAA 7S0 

40 

GCCTTACTCC GCCAAGTATT CTGCTGCCCG CTCTGCTGCT GCCAGCAACT ACGTGTAAGG 3 40 

TGCCACGGCT CCACTCTGTT CCTCTCTGCT TTGTTCTTCC CTGGACTGAG CTCAGCGCAG 900 

45 GCTGTGACCC CAGGAGGGCC CTGCCACGGG CCACTGGCTG CTGGGGACTG GGGACTGGGC 960 

AGAGACTGAG CCAGGCAGGA AGGCAGCAGC CTTCAGCCTC TCTGGCCCAC TCGGACAACT 1020 ' 

TCCCAAGGCC GCCTCCTGCT AGCAAGAACA GAGTCCACCC TCCTCTGGAT ATTGGGGAGG 1080 

50 

GACGGAAGTG ACAGGGTGTG GTGGTGGAGT GGGGAGCTGG CTTCTGCTGG CCAGGATGGC 1140 

TTAACCCTGA CTTTGGGATC TGCCTGCATC GGTGTTGGCC ACTGTCCCCA TTTACATTTT 1200 

-5 CCCCACTCTG TCTGCCTGCA TCTCCTCTGT TGCGGGTAGG CCTTGATATC ACCTCTGGGA 1260 

CTGTGCCTTG CTCACCGAAA CCCGCGCCCA GGAGTATGGC TGAGGCCTTG CCC-JCCCACC 1320 

TGC CTGGGAA GTGCAGAGTG GATGGACGGG TTTAGAGGGG AGGGGCGAAG ' GTGCTGTAAA 1330 

60 
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CAGGTTTGGG CAGTGGTGGG GGAGGGGGCC AGAGAGGCGG CTCAGGTTGC CCAGCTCTGT 1440 

GCC CTCA.GG A CTCTCTGCCT CACCCGCTTC AGCCCAGGGC CCCTGGAGAC TGATCCCCTC 1500 

5 TGAGTCCTCT GCCCCTTCCA AGGAOiCTAA TGAGCCTGGG AGGGTGGCAG GGAGGAGGGG IS 60 

ACAGCTTCAC CCTTGGAAGT CCTGGGGTTT TTCCTCTTCC TTCTTTGTGG TTTCTGTTTT 1520 



GTAATTTAAG AAGAGCTATT CATCACTGTA ATTATTATTA TTTTCTACAA TAAATGGGAC 1530 

10 - w ' 

CTGTGCACAG GRAAAAAAAA AAAAG 1705 



15 

(2) INFORMATION FOR SEQ ID MO: 107; ' 

(i) SEQUENCE CHARACTERISTICS : ' 

(A) LENGTH; 1157 base pairs . 
20 , (3) TYPE: nucleic acid 

(C) STRAHDEDNE33 :/ double 

(D) TOPOLOGY: linear 

(:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

25 

TGCAGGAATT CGGCAGAGGT TTTCCGCTAG ACTCTGGCAG TTGGTGAGCA TCATGGCAAC SO 

CGTTACAGCC ACAA.CCAAAG TCCCGGAGAT CCGTGATGTA ACAAGGATTG AGCGAATCGG 120 

30 TGCCCACTCC CACATCCGGG GACTGGGGCT GGACGATGCC TTGGAGC'CTC GGCAGGCTTC 13 Q 

GCAAGGCATG GTGGGTCAGC TGGCGGCACG GCGGGCGGCT GGCGTGGTGC TGGAGATGAT 240 

CCGGGAAGGG AAGATTGCCG GTCGGGCAGT CCTTATTGCT GGCC+\GCCGG GCACGGGGAA 300 

35 " 

GACGGCCATC GCCATGGGCA TGGCGCAGGC CCTGGGCCCT GACAC GCC AT TCACAGCC AT . 350 

CGCCGGCAGT GAAATCTTCT CCCTGGAGAT GAGCAAGACC GAGGCGCTGA CGCAGGCCTT 420 

40 CCGGCGGTCC ATCGGCGTTC GCATCAAGGA GGAGACGGAG ATCATCGAAG GGGAGGTGGT 430 

GGAGATCCAG ATTGATCGAC CAGCAACAGG 'GACGGGCTCC AAGGTGGGCA AACTGACCCT 540 

CAAGACCACA GAGATGGAGA C CATCTACGA CCTGGGCACC AAGATGATTG AKTCCCTGAC 600 

45 

CAAGGACAAG GTCCAGGCCG GGGACGTGAT CACCATCGAC AAGGCGACGG GCAAGATCTC 650 

CAAGCTGGGC CGCTCCTTCA CACGCGCCCG CGAACTACGA CGCTATGGGC TCCCAGACCA 72Q 

50 AGTTCGTGCA GTGCCCAGAT GGGGAGCTCC AGAAACGCAA GGAGGTGGTG CACACCGTGT 730 

CCCTGCACGA GATCGACGTC ATCAACTCTC GCACC CAGGG ' CTTCCTGGCG CTCTTCTCAG 340 



GTGACACAGG GGAGATCCAG TCAGAAGTCC GTGAGCAGAT . CAATGCCAAG GTGGCTGAGT 5Q0 

55 

GGCGCGAGGA GGGCAAGGCG GAGATCATCC CTGGAGTGCT GTTCATCGAC GAGGTCCACA £60 

TGCTGGACAT CGAGAGCTTC TCCTTCCTCA ACCGGGCCCT GGAGAGTGAC ATGGCGCCTG 1020 

60' TCCAGCAGGT CTATGGGGAT GCC GTG AGGG CTCTGGTAGC TGGTGCCCCG GATTCGCGTG 1030 
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20 



359 

ATC-CCACGGT TGGTGGCCTC GTGCCGAATT CCTGCAGCCC GGGGGATCCA CTAGTTCTAG 1140 
AGCGGCCGCC ACCGCGGTGG ANCTCCN 1157 

(2) INFORMATION FOR SEQ ID MO: 108: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1907 base pairs 
(3) TYPE: nucleic acid 
(C) STRANCEDNE3 3 : double 
15 (D) TOPOLOGY: - linear 



(:ci) SEQUENCE DESCRIPTION: SEQ ID NG:_ 109 : 



GGCACAGGGG AA.TCATCGTG TGATGTGTGT. GCTGCCTTTG TGAGTGTGTG GAGTCCTGCT 60 

CAGGTGTTAG GTACAGTGTG TTTGATCGTG GTGGCTTGAG GGGAACCCTT GTTCAGAGCT 120 

GTGACTGCGG CTGCACTCAG AGAAGCTGCC CTTGGCTGCT CGTAGCGCCG GGCCTTCTCT 130 

25 CCTCGTCATC ATCCAGAGCA GCCAGTGTCC GGGAGGCAGA. AGGTACCGGG GCAGCTACTG 240 

GAGGACTGTG CGGGCCTGCC TGGGCTGCCC CCTCCGCCGT . GGGGCCCTGT TGCTGCTGTC 300 

CATCTATTTC TACTACTCCC TCCCAAATGC GGTCGGCCCG CCCTTCA.CTT GGATGCTTGC 3 60 

30 

CCTCCTGGGC CTCTCGCAGG CACTGAACAT CCTCCTGGGC CTCAAGGGCC TGGCCCGAGC 420 

TGAGATCTCT GCAGTGTGTG AAAAAGGGAA TTTCAACGTG GCCCATGGGC TGGCATGGTC 430 

35 ATATTACATC GGATATCTGC GGCTGATCCT GCGAGAGCTC CAGGCCCGGA TTCGAACTTA 540 

CAATCAGCAT TACAACAACC TGCTACGGGG TGCAGTGAGC CAGCGGCTGT ATATTCTCCT 600 

CCCATTGGAC TGTGGGGTGC CTGATAACCT GAGTATGGCT GA.CCC GAACA TTCGCTTCCT 660 

40 

GGATAAACTG CCCCAGCAGA CCGGTGACCG TGCTGGCATC AAGGATCGGG TTTACAGCAA 720 

CAGCATCTAT GAGCTTCTGG AGAACGGGCA GCGGGCGGGC ACCTGTGTCC TGGAGTACGC 730" 

45 CACCCCCTTG CAGACTTTGT TTGCCATGTC ACAATACAGT CAAGCTGGCT TTAGCGGGGA 840 

GGATAGGCTT GAGCAGGCCA. AACTCTTCTG CCGGACACTT GAGGACATCC TGGCAGATGC 9G0 
CCCTGAGTCT CAGAACAACT GCCGCCTCA? TGCCTACCAG GAACCTGCAG ATGACAGCAG -\ 960 

50 

CTTCTCGCTG TCCCAGGAGG TTCTCCGGCA CCTGCGGCAG GAGGAAAA.GG AAGAGGTTAC IQ2Q 

TGTGGGCAGC TTGAAGACCT CAGCGGTGCC CAGTACCTCC ACGATGTCCC AAGAGCCTGA 1030 

55 GCTCCTCATC AGTGGAATGG AAAAGCCCCT CCCTCTCCGC ACGGATTTCT CTTGAGACCC 1140 . 

AGGGTCACCA GGCCAGAGCC TCCAGTGGTC TCCAAGCCTC TGGACTGGGG GCTCTCTTCA 1200 

GTGGCTGAAT GTCCA.GCAGA GCTATTTCCT TCCACAGGGG GCCTTGCAGG GAAGGGTCCA 1260 

60 
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GGA.CTTGACA TCTTAAGATG CGTCTTGTCC CCTTGGGCCA GTCATTTCCC CTCTCTGAGC 1320 

CTCGGTGTCT TCAACCTGTG AAATGGGATC ATAATCACTG CCTTACCTCC CTCACGGTTG 1280 

TTGTGAGGAC TGAGTGTGTG GAAGTTTTTC ATAAACTTTG GATGCTAGTG TACTTAGGGG 1440 

GTGTGCCAGG TGTCTTTCAT GGGGCCTTCC AGACCCACTC CCCACCCTTC TCCCCTTCCT 1500 

TTGCCCGGGG ACGCCGAACT CTCTCAATGG TATCAACAGG CTCCTTCGCC CTCTGGCTCC 'l560 

TGGTCATGTT C CATT ATTGG GGAGCCCCAG CAGAAGAATG GAGAGGAGGA GGAGGGTGAG 1620 

TTTGGGGTAT TGAATCCCCC GGCTCCCACC CTGCAGCATC AAGGTTGCTA TGGACTCTCC 1580 

15 TGCCGGGCAA CTCTTGCGTA ATCATGAGTA TCTCTAGGAT TCTGGCACCA CTTCCTTCCC 1740 

TGGCCCCTTA AGCCTAGCTG TGTATCGGCA CCGCCACCCC A.CTAGAGTAC TCCCTCTCAC . 1300 

TTGCGGTTTC CTTATACTCC ACCCCTTTCT CAACGGTCCT TTTTTAAAGC ACATCTCAGA 1360 

TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAGGG CGGCCGC 1907 



10 



20 



25 



35 



55 



(2) INFORMATION FOR SEQ ID MO: 109;, 



(i) SEQUENCE CHAPACTERI 3TXCS : 

(A) LENGTH: 611 base pairs 
30 -(B) TYPE: nucleic acid 

(C) STRANDEDNES3 : double 

(D) TOPOLOGY: linear 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

ATGAATTAAC GCCAAGCTNT NAATAGGGAC TCACTATGGG GGAAAGNTGG GTAACGCCTG 60 

CAGGTACCGT TCCGGAATTC CCGGGTCGAC CCACGCGTCC GATGGGGCTT . TAGTAAATCA 120 

40 GGCTTGCAGG CTCAAAGCTG CAATCTGCCC AGTCTCAGGT ACTGAGACTT TGTGGGCCTC -180 

AGACACCAGG AAGAAAGTTG GGAT ACAGT C ATTTGAGTTA AAAAGGGAAT GACCCCTCAG 240 

AAACCCGCAT TAGCAGTGTT ACTCTTGGAA GTGCCTTTAC TTTTAACGCT CTCTGTTCTG 300 

45 

AAAAA3AGGT GTTTGGTTAC GTGTGAGCCA ACATCACGTT TTGTTAGCTG TGATTTACCT -360 
TTGTCCGTTT ' AAAAGACTTC ACGGAGCCAT TCTGTATACA AGGTGTGCTC TTTCCAATGT . ... 420 

50 AGAAGGGGTT ATGGAAAAGG GTGCGATCCT TTGCTGTAAA CTGGAGAGAC CAGTCCCAAA. 480 

CAGAGGGGAA TTTTAAGCCC TTCTCATCAC CCAATTGGAT GTTTTTGCTT ATACCAAATT 540 

CCTGCAAAAT AAATAAATAA ATATTTGCAA AAGTAAAAAA AAAAAAAAAA AAAAAAAAAA 600 

GGGGGGNCCN C oil 



60 
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(2) INFORMATION FOR SZQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2632 base pairs 
5 (3) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: HQ: 

10 

TCCCAGCTCT CAGGACAAGG GCCCTGGGCG ATCTTTTAAA AAAGCCGATT GGGTGTCTTT 50 

CTAAAAOTAC AACCAGTACT TCATCGTCAA GTTTCTGGGA AGGGAGTCCC CTCCAGATTC 120 

15 TCATGGAGTG ACAAATCTTG ACTCTTGCTC CTGGAATTTT TCAGGCCCAA ACTAGCGTTT 130 
CTACAATGAT TTATTTGGCA AATTTGTCTT GATTATGGGT GGCTGATGAG GAACGTGCTT r 240 

TTGTTAGGAA CCGAAACTGG GCGGCGGTGA GGGCGTGTAC GCAATGAGTC CGGAAGAGGG 300 

20 

TGAAATGCTT TCGGTAGGCA CTCCACGGCT GTGAAGATGG CGGCGGCTGC GTGGCTTCAG 350 

GTGTTGCCTG TCATTCTTCT GCTTCTGGGA GCTCACCCGT CACCACTGTC GTTTTTCAGT 420 

25 GCGGGACCGG CAACCGTAGC TGCTGCCGAC CGGTCCAAAT GGCACATTCC GATACCGTCG 430 

GGGAAAAATT ATTTTAGTTT TGGAAAGATC CTCTTCAGAA ATACCACTAT CTTCCTGAAG 540 

TTTGATGGAG AACCTTGTGA CCTGTCTTTG AATATAACCT GGTATCTGAA AAGCGCTGAT 600 

30 

TGTTACAATG AAATCTATAA CTTCAAGGCA GAAGAAGTAG AGTTGTATTT GGAAAAACTT ooQ 

AAGGAAAAAA GAGGCTTGTC TGGGAAATAT CAAACATCAT CAAAATTGTT CCAGAACTGC 720 

35 AGTGAACTCT TTAAAACACA GACCTTTTCT GGAGATTTTA TGCATCGACT GCCTCTTTTA 730 

GGAGAAAAAC AGGAGGCTAA GGAGAATGGA ACAAACCTTA CCTTTATTGG AGACAAAACC 840 

GCAATGCATG AACCATTGCA AACTTGGCAA GATGCACCAT ACATTTTTAT TGTACATATT 900 

40 

GGCATTTCAT CCTCAAAGGA ATCATCAAAA. GAAAATTCAC TGAGTAATCT TTTTACCATG . 960 

ACTGTTGAAG TGAAGGGTCC CTATGAATAC CTCACACTTG AAGACTATCC CTTGATGATT 1020 

45 TTTTTCATGG TGATGTGTAT TGTATATGTC CTGTTTGGTG TTCTGTGGCT GGCATGGTCT 1080 

GCCTGCTACT GGAGAGATCT CCTGAGAATT CAGTTTTGGA TTGGTGCTGT CATCTTCCTG 1140 

GGAATGCTTG AGAAAGCTGT CTTCTATGCG GAATTTCAGA ATATCCGATA CAAAGGARAA 1200' 

50- 

TCTGTCCAGG GTGCTTTGAT CCTTGCAGAR CTGCTTTCAG CAGTGAAACG CTCACTGGCT 1260 

CGAACOCTGG TCVTCATAGT CAGTCTGGGA TATGGCATCG TCAAGCCACG CCTGGAGTCA 1320 

55 CTCTTCATAA GGTTGTAGTA GCAGKAGCCC TCTATCTTTT GTTCTCTGGC ATGGAAGGGG 1330 

TCCTCAGAGT TACTGGGGCC CAGACTGATC TTGCTTCCTT GGCCTTTATC CCCTTGGCTT 1440 

TCCTAGACAC TGCCTTGTGC TGGTGGATAT TTATTAGC CT GACTCAAACA ATGAAGCTA.T 150T) 

60 •* 



WO 93/54963 



362 

TAA-AACTTCG GAGGAACATT GTAAAACTCT CTTTGTATCG GCATTTCACC AACAC GCTTA 
TTTTGGCAGT GGCAGCA.TCC ATTGTGTTTA TCATCTGGAC AACCATGAAG TTCAGAATAG 
TGACATGTCA. GTCC-GACTGG CGGGAGCTGT GGGTAGACGA TGCCATCTGG CGCTTGCTGT 
TCTCCATGAT CCTCTTTGTC ATCATGGTTC TCTGGCGACG ATCTGCAAAC AACCAGAGGT 
TTGCCTTTTC ACCATTGTCT GAGGAAGAGG AGGAGGATGA ACAAAAGGAG CCTATGGTGA 
AAQAAAGCTT TGAAGGAATG AAAATGAGAA GTA.CCAAACA. AJGAACCCAAT" GGAAATAGTA 
AAGTTAACAA AGCACAGGAA GATGATTTGA AGTGGGTAGA AGAGAATGTT CCTTCTTCTG 
TGACAGATGT AGCACTTCCA GCCCTTCTGG ATTCAGATGA GGAACGAATG ATCACACACT 
TTGAAAGGTC CAAAATGG AG TAAGGAATGG GAAGATTTGG AGTTAAAGAT GGCTACCATC 
AGGGAAGAGA , TCAGCATCTG TGTCAGTCTT CTGTACGGCT CCATGGGATT AAAGGAAGCA. 
ATGAGATCCT GATCTGTTCC TTGATCTTTG GGGATTGGAG TTGGGGAGAG GTGTCAGAAC 
AAAGAGAACA TCTTACTGAA AACAAGTTCA TAAGATGAGA AAAATGTACG AGCTTCTTAT 
TTACAACACT GGTGGCGCCT TTCCTCCCAG ACTCTGACAT GGATGTTCAT GCAACTTAAG 
TGTGTTGTTC CTGAACTTTC TGTAATGTTT GATTTTTTAA ATCTGA.CAAA. CTAAAAAGTT 
TAACGTCTTC TAAAAGATTG TCATCAACAC CATAATATGT AATCTCCAGG AGCAACTGCC 
TGTAATTTTT ATTTATTTAG GGAGTTACAT AGGTGATGGG GGAAATTGTT AACTACCTTT 
CATTTTCCTG GGAAGTCAAG GTTACATCTT GCAGAGGTTG TTTTGAGAAA AAAGGGCCCT 
TCTGAGTTAA GGAGGCATAG TTCTATCAAT GATCAAAAGA AAAAAAAAAA AAGTCGATCG 
GCACGAGGGG GGGCCCGGTA CCCAATTCGC CCTATGGGAN TGGAATGAGA CC 

(2) DMFGRMAT ION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2249 base pairs 
(3) TYPE: nucleic acid 
(C) STRA^fDEDNES S : double 
<D) TOPOLOGY: linear ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

GAATTCGGCA CGAGCTCACC GTGCTGCGTG ACA.CAAGGCC AGCCTGCGCC TACGAGCCCA 

TGGACTTTKT RATGGCCCTC ATCTACGACA TGGTACTGSW TGTGGTCACC " CTGGGGCTGG 

CCCTCTTCAC TCTGTGCGGC AAGTTCAAGA GGTGGAAGCT GAACGGGGCC TTCCTCCTCA 

' TCACAGCCTT CCTCTCTGTG CTCATCTGGG TGGCCTGGAT GACCATGTAC CTCTTCGG.CA 

ATGTCAAGCT GCAGCAGGGG GATGCCTGGA ACGACCCCAC CTTGGCCATC ACGCTGGCGG 
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CCAGCGCTGG GTCTTCGTCA TCTTCCACGC CATCCCTGAG ATCCACTGCA CCCTTCTGCC 360 

AGGCCTGCAG GAGAACACGC CCAACTACTT CGACACGTCG CAGCCCAGGA TGCGGGAGAC 420 

5 

GGCCTTCGAG GAGGACGTGC AGCTGCCGCG GGCCTATATG GAGAA.CAAGG CCTTCTCCAT . 430 

GGATGAAGAC AATGCAGCTC TGCG.iAC.AGG AGGATTTCCC AACGGCAGCT TGGGAAAAAG 540 

10 ACCCAGTGGG AGGTTGGGGA AAAGACCCAG CGCTCCGTTT AGAAGCAACG TGTATCACCC 6Q0 

AACTGAGATG GCCGTCGTGC TCAACGGTGG GACGATCCCA ACTGCTCCGC . CAAGTGACAC 660 

AGGAAGAMAC CTTTGGTGAA AGACTTTAAG TTCCAGAGAA TCAGAATTTC TCTTACCGAT 720 
TTGCCTCGCT GGCTGTGTCT TTCTTGAGGG AGAAATGGGT AACAGTTGCC GAACCAGGCC . 780 

GCCTCACAGC CAGGAAATTT GGAAATCCTA GCCAAGGGGA TTTCGTGTAA ATGTGAACAC 340 

20 TGACGAACTG AAAAGGTAAC ACCGACTGCC OGGCCCTCCC CTGCCAiCACA CACAGACACG - 900 

TAATACCAGA CCAACCTCAA TCCCCG2AAA CTAAAGCAAA GGTAATTGCA AATAGTATTA 960 

GGCTCACTGG AAAATGTGGC TGGGAAGACT GTTTCATCCT CTGGGGGTAG AACAGAACCA 1020 

25 

AATTCACAGC TGGTGGGCCA GACTGGTGTT . GGTTGGAGGT GGGGGGCTCC CACTCTTATC 1030 

ACCTCTCCCC AGCAAGTGCT GGACCCCAGG TAGCCTCTTG GAGATGACCG TTGGGTTGAG 1140 

30 GACAAATGGG GACTTTGCCA CCGGCTTTGC CTGGTGGTTT GCACATTTCA GGGGGGTCAG 1200 

GAGAGTTAAG GAGGTTGTGG GTGGGATTCC AAGGTGAGGC CCAACTGAA.T CGTGGGGTGA 1250 

GCTTTATAGG CAGTAGAGGT GGAGGGACCC TGGCATGTGC CAAAGAAGAG GCCCTCTGGG 1320 

35 

TGATGAAGTG ACCATCACA.T TTGGAAAGTG ATCAACCACT GTTCCTTCTA TGGGGCTCTT 1330 

GCTCTAGTGT CTATGGTGAG AACACAGGCG CCGGCCCTTC CGTTGTAGAG CCATAGAAAT 1440 

40 ATTCTGGCTT GGGGCA.GCAG TCCCTTCTTC CCTTGATCAT CTCGCCCTGT TCCTACACTT 1500^ 

ACGGGTGTAT CTCCAAATCC TCTCCCAATT TTATTCCCTT ATTCATTTCA "AGAGCTCCAA. 1560 

TGGGGTCTCC AGGTGAAAMS CCCTCCGGGA GGCAGGTTGG AAJGGCAGGCA CCACGGGAGG 1520 

45 

TTTTCCGCGA TGATGTCAC C TAGGAGGGGT TCAGGGGTTC CCACTAGGAT GCAGAGATGA 1530 
CCTCTCGCTG CCTCACAAGC AGTGACACCT CGGGTCCTTT CCGTTGCTAT GGTGAAAATT . 1740 

50 CGTGGATGGA ATGGATCACA TGAGGGTTTC TTGTTGCTTT TGGAGGGTGT GGGGGATATT 1300 

TTGTTTTGGT TTTTCTGCAG GTTCCATGAA AACAGGCCTT TTCCAAGCCC. ATTGTTTCTG 1360 

TCATGGTTTC CATCTGTCCT GAGCAAGTCA TTCCTTTGTT ATTTAGCATT TCGAACA.TCT 1920 

55 

CGGCCATTCA AAGCCCCCAT GTTCTCTGCA CTGTTTGGCC AGGATAACCT CTAGCATCGA. 1980 

TTCAAAGCAG A G TTTTAACC TGACGGCATG GAATGTATAA ATGAGGGTGG GTCCTTCTGG 204Q- 

60 AGATACTCTA ATCA.CTACAT TGCTTTTTGT ATAAAACTAC CCATAAGGCT TTAAC CTTT A 2100 
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AAGftAWu^TC- .Vu^-rTA GTG77TGGGG GC ZGGGGG AG GACTGACGGG TTCATAAGCC 2160 

AGTAGGTGTG AGG7GAG7AT GT77G.-Ar.-_-. AGG ITTTGAT ATTTGTCAAA AAAAAAAAAA 2220 

5 

AAAAAI'ICC GG C-GGGGGGC-2G GGGAGG7G-G 2249 



10. 

(2) isfcs-GTICji ?c?. seq zz i:c: .212; 

(i) S3QC__-3C2 e-r-J?AG7G7_:_T7G3 : 

(A) 1_G:GT:-:: 2133 base oairs 
15 (3) rr?_:: r— dais icid 

(C) ST?A27:Z7IIZGG : icuble 
(7) GOSCXOCTi: Lir.sar 

(xi) 5ZQUE27CZ CZSCSI^TICN : GZQ ID NO: 112 : 

20 • 

GATACTATAA G7CAAGTGAC 7GAGGGG7GG GGGGTTAGAC TAGTGGATCG CGGGTGCAGG 60 
AJVTTCGGCAiG AG7G-7GG-77G GAG77GAA2T GCTGGCGGCC CCGCGGCCGC TGCCTCCGCG 120 
25 a-JHCCCAA-AA TGATGAAAGT GAG 7GTGAAG AG G CCGAAGA AAAGGAGGAA TTCGCCGTGC 130 

1*4 

CCGAGAAGA.G CTGGGTCGAG -7AJ_7GT.-A.GG AAGAAATGTC TAAACGTTTT AAATCACATA ■ 240 

CTGACC-A.GT" 7GTGTTGA7A. TT7GGTG7AA AAATTTTGAA AGATCAAGAT ACGTTGAGTC 300 

30 

AGCATGGAA.T TGA7GATG2A GT7A7TGTGG AGGTTGTGAT TAAAAGACAA AACAGGCGTC 360 

AGGATCATTC AGCTCAGtAA. ACA-ATAGAG GTG^TAAGGAA TGTTACTACA TCATGAACTG 420 

35 CTAATAGTAA G7CTA.GA7GT GGT7GTGGTA CTAGCAAGCG TTTTGGTTTA GGTGGCCTTG 48 Q 

GGGGA.CTTGG A7GGGTGAGT .A72T7GGG7T TGAATACTAC CAAGTTCTGT GAACTACAGA 540 

GTGAGATGCA GCGL-iTAACTT 7TG7GTAACG C7GAAATGAT GGTGGAGATG ATGGAAAAWC 600 

40 ~ 

CCmTGTTCA GAGGATGG7G I TGAAA.GGGT GACCTGATGI'I AGAGAGTTAA. TTATGGGGAA '660 

TGC\CAAATG CAGGAG77GA TAC-jGAa-AA TGGC^jGAAAT TAGTCATATG TTGAATAATG 720 

45 CAGATATAA.T GAJGA2AAA7G TTG7AA-CTTG CG GAGGAATG CAGCAATGAT GCAGGAGATG 790 

ATGAGGAAG G AGGA7GGAGG TT7GAC<7-AG CTA.GAAAGGA. TGGCAGGGGG ATATAATGGT 340 

TTAAGGGGCA TGTAJ7AGAGA TAT7GA2<2A\ C'GAATGGTGA GTGGTGCACA AGAGGAGTTT 900 

50 

GGTGG7AATC C-.TTTGGT7C CTTGGTGAGG AATAGATGCT CTGGTGAAGG TAGTGAACGT 360 

TCGCGTACAG AAAATAJGAGA TGCACTAGGG AATGCATGGG GTCCACAGAG TTG2GAGAGT 1020. 

55 TCATC-jGGTT CGAGCGGCAC TGG7AGGAGT GTGGGTGGGA CTACTGGTAG TAGTGGGAGT 1080 

GGGACTTGTG G7GAJGAG7AC TAG77-GGGCA AATTTGGTGC GTGGAGTAGG AGGTAGTATG 1140 

TTGAAGACAG GAGGAATG-GA GAG77TGTTG GAACAAATAA CTGAAAACGG ACAACTTATG ' 12Q0 

60 
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CAAAACATGT TGTCTGCCCC CTACATGAGA- AGCATGATGC AGTCACTAAG CCAGAATCCT 
GACCTTGCTG CACAGATGAT GCTGAATAAT CCCCTATTTG CTGGAAATCC TGAGCTTCAA. 
GAACAAATGA GACAACAGCT CCCAACTTTC CTCCAACAAA. TGCAGAA.TCC TGATACACTA 
TCAGCAATGT CAAACCCTAG AGCAATGCAG GCCTTGTTAC AGATTCA.GCA GGGTTTACAG 
ACATTAGCAA CC-GAAGCCCC GGGCCTCATC CCAGGGTTTA CTCCTGGCTT GGGGGCATTA 
GGAAGCACTG GAGGCTCTTC GGGAACTAAT GGATCTAACG CCACACCTAG TGAAAACACA 
AGTGCCACAG CAGGAACCAC TGAACCTGGA CATCAGCAGT TTATTCAGCA GATGCTGCAG 
GCTCTTGCTG GAGTAAATCC TCAGCTACAG AATCCAGAAG TCAGATTTCA GCAACAACTG 
GAACAACTCA GTGCAATGGG ATTTTTGAAC CGTGAAGCAA ACTTGCAAGC TCT AATAGCA 
ACAGGAGGTG ATATCAATGC AGCTATTGAA AGGTTACTGG GCTCCCAGCC ATCATAGCAG 
CATTTCTGTA TCTKGAAAAA ATGTAATTTA TTTTTGATAA CGGCTCTTAA ACTTTA AAAT 
ACCTGCTTTA TTTCATTTTG ACTCTTGGAA TTCTGTGCTG TTATAAACAA ACCCAATATG 
ATGCATTTTA AGGTGGAGTA CAGTAAGATG TGTGGGTTTT TCTGTATTTT TCTTTTCTGG 
AACAGTGGGA ATTAAGGCTA CTGCATGCAT CACTTCTGCA TTTATTGTAA TTTTTTAAAA 
ACATCACCTT TTATAGTTGG GTGACCAGAT TTTGTCCTGC ATCTGTCCAG TTTATTTGCT 
TTTTAAAGAT TAGCCTATGG TAGTAATTTA TGTAGAATAA AAGCATTAAA AAGAAGCAAA 
AAAAAAAAAA AAAAATTCCT GCGCCCGCGA ATTCTTCT 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1043 base pairs 
(3) TYPE: nucleic acid 
(C) STKANDEDNE3S : double 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

CTG AAGTGT A TGTGGTGAGG AAGAAGAGGC TCCTACTGTA GACAGCCTTG TTCTACAGAT 

CCTCCCAGAA ATCTCTGGGC CAGGTGGAA.C CCAGGGTCAG AGAGGGATGG GAGAGAGGTT 

TAATTTTCCA TGATAAATAA AAATCTATAA AATAATAAAC AAGAGAAAAG AGATTGGAAA 

CAGCCAGGTT GGAGCAGTGA GTGAGTAAGG AAACCTGGCT GCCCTCTCCA GATTCCCCAG 

GCTCTCAGAG AAGATCAGCA GAAAGTCTGC AAGACCCTAA GAACCATCAG CCCTCAGCTG 

CACCTCCTCC CCTCCAAGGA TGAOAAGGC GCTACTCATC TATTTGGTCA GCAGCTTTCT 

TGCCCTAAAT CAGGCCAGCC TCATCAGTCG CTGTGACTTG CCCCAGGTGC TGCAGCTGGA 
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RGACTTGGAT GGGTTTGAGG GTTACTCCCT GAGTGACTGG CTGTGCCTGG CTTTTGTGGA 430 

AAGCA^GT^C AACATATCAA AGATWAATGA AAATGCAGAT GGAAGCTTTG ACTATGG30T 540' 

5 

CTTCCAGATC AACAGCCACT ACTGGTGCAA CPATTATAAG AGTTACTGGG AAAACCTTTG 600 

GCACGTAGAG TGTCAAGATC TGCTGAATCC CAACCTTCTT GCAGGCATCC ACTGCGCAAA 660 ■ 

10 AAGGATTGTG TCCGGAGCAC GGGGGATGAA CAACTGGGTT AGAATGGAAG KTTGCACTGT 720 

TCAGGCCGGC CACTCTTCTA CTGGCTGACA GGATGCCGCC TGAGATKAAA CAEGGTGCGG 730 

GTGCACCGTG GARTCATTCC AAGACTCCTG TCCTCACTCA RGGATTCTTC ATTTCTTCTT 840 

15 

CGTACTGCCT CCACTTCATG TTATTTTCTT CCCTTCCCAT TTACAACTAA AA.CTG AC CAG 900 

AGCCCCAGGA ATAAATGGTT TTCTTGGCTT CCTCCTTACT CCCATCTGGA CCCAGTCCCC 960 

20 TGGTTCCTGT CTGTTATTTG TAAAOTGAGG ACCACAATAA AGAAATCTTT ATATTTATCG 1020 

AAAAAAAAAA AMAMAACT CGA' 1043 



25 

* (2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 703 base oairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 1 — ; " ' 

GAATTCGGCA CGAGTGCGCG GGO^CCACGG CGGTTTTTCG ACGCTGGCGG TGGACGCAGG 60 

CAGCATGGAC CACGGTTGCT GGGCGGATGG GGAGCGTCTA TGGTCAGTTG CCTTAGAAGT 120 

40 

• GGTGAGATGG GAAGCTGCAG TTGGAAGACC CTGGAGGATG CCTGACAAGG GGATGTCTGA 130 
' CACATGATTG GAGCTCTTTT TGAAATGTTT CTTGCCCTTC CTGGAGCAGA GGAGCCATTA 240 

45 . TTTATGCAGG TACATCGAAG TCTTTTGACC TCCATACAGT GATTATGCTT GTCATCGCTG 3QQ 

GTGGTATCCT GGCGGCCTTG CTCCTGCTGA TAGTTGTCGT GCTCTGTCTT TACTTCAAAA 360 

TACACAACGC GCTAAAAGCT GCAAAGGAAC CTGAAGCTGT GGCTGTAAAA A^TCACAACC 420 

50 

■ CAGACAAGGT GTGGTGGGCC AAGAACAGCC AGGCCAAAAC CATTGCCACG GAGTCTTGTC * 430 

CTGCCCTGCA GTGCTGTGAA GGATATAGAA TGTGTGCCAG TTTTGATTCC CTGCCACCTT 540 

55 GCTGTTGCGA CVTAAATGAG GGCCTCTGAG TTAGGPAAGG TGGCCAOAA AATCTTCATG 60 Q 

AGCAATACTT CTTAGTAGAT TGTTTTGTTA TTCAAATCAA GTTCTAGTGT TTTTATGTGA 660 

GATTATATAA TTTACAGTGT TGTTTTATAT ACTTTTGAAT AAA "703 

60 
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(2) INFORMATION FOR SEQ ID NO: IIS: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3684 base pairs 
(3) TYPE: nucleic acid 
(C) ST3ANDEDNES3 : double 
"(D). TOFOLCGY : linear 

(>ci) SEQUENCE DESCRIPTION: SEQ ID NQ: 115: 

GGCAGAGGGG GCATGAGCAG GAGGAGGATT ACCGCTACGA GGTGCTCACG GC CGAGCAGA 60 

TTCTACAACA CATGGTGGNA ATGTATCCGG GAGGTCAACG AGGTCATCCA GAATCCAGCA 12 Q 

AGTATCACAA GAATACTCCT" TAGGCACTTC AATTGGGATA AAGAGAAGCT AATGGAAAGG 130 

TACTTTGATG GAAACCTGGA GAAGCTCTTT GCTGAGTGTC ATGTAATTAA TCCAAGTAAA 240 

AAGTCTCGAA CACGCCAGAT GAATACAAGG TCATCAGCAC AGGATATGCC TTGTCAGATC 30Q 

TGCTACTTGA ACTACCCTAA CTCGTATTTC ACTGGCCTTG AATGTGGACA TAAGTTTTGT 360 

ATGCAGTGCT GGAGTGAATA TTTAACTACC AAAATAATGG AAGAAGGCAT GGGTCAGACT 42 Q 

ATTTCGTGTC CTGCTCATGG TTGTGATATC TTAGTGGATG ACAACACAGT TATGCGCCTG 480 

ATCACAGATT CAAAAGTTAA ATTAAAGTAT CAGCATTTAA TAACAAATAG CTTTGTAGAG 540 

TGCAATCGAC TGTTAAAGTG GTGTCCTGCC CCAGATTGCC ACCATGTTGT TAAAGTCCAA . 600 

TATCCTGATG CTAAACCTGT TCGCTGCAAA TGTGGGCGCC AATTTTGCTT TAACTGTGGA 660 

GAAAATTGGC ATGATCCTGT TAAATGTAAG TGGTTAAAGA AATGGATTAA AAAGTGTGAT 720 

GATGACAGTG AAACCTCCAA TTGGATTGCA GC CAACACAA AGGAATGTCC CAAATGCCAT " 730 

GTCACAATTG AGAAGGATGG TGGTTGTAAT CACATGGTCT GTCGTAACCA GAATTGTAAA 840 

GCAGAGTTTT GCTGGGTGTG TCTTGGCCCA TGGGAACCAC ATGGATCTGC CTGGTACAAC 500 

TGTAACCGCT ATAATGAGGA. TGATGCAAAG GCAGCAAGAG ATGCACAGGA GCGATCTAGG 960 

GCAGCCCTGC AGAGGTACCT GTTCTACTGT AATCGCTATA TGAACCACAT GCAGAGCCTG 1020 

CGCTTTGAGC ACAAACTATA TGCTCAGGTG AAACAGAAAA TGGAGGAGAT GCAGCAGCAC 1080 

AACATGTCCT GGATTGAGGT GCAGTTCCTG AAGAAGGCAG TTGATGTCCT CTGCCAGTGT 1140 

CGTGCCACAC TCATGTACAC TTATGTCTTC GCTTTCTACC TCAAAAAGAA TAACCAGTCC 1200 

ATTATCTTTG AGAATAACCA AGCAGATCTA GAGAATGCCA CAGAGGTGCT CTC'GGGCTAC 1260 

CTTGAACGAG ATATTTCCCA AGATTCTCTG CAGGATATAA AGCAGAAAGT ACAAGACAAG 1320 

TACAGATACT GTGAGAGTCG ACGAAGGGTT TTGTTACAGC ATGTGCA.TGA AGGCTATGAA 1380 

AAAGATCTGT GGGAGTACAT TGAGGACTGA GAATGGCCCT GCAT.AAAA.TG AACTCTGAAA. 1440 
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ACTTTACCAT CTAGAGTGGT CATGCAATTA AAACAAAACA AACACAAACA AGGAGGCACT 1S0Q 

AAGCCTATTC TGAGACCACT GGTCTGTAGT ACCAGAATTG TTTTGTTAAT GGAAAGTTTA 1550 

AGTAAATTAT ATTGTAATAA AAAGGTAGAT AAPtCCATTGT ACAACAGTAT TCTAGGCCGC 1620 

CAACAAAAGT GTGACAGAGA CACTAAAAGG CCTCCAACTT TAACTTGTAA CGTAGCTTCA 1530 

TTGTCAAAGG TGACTCGTTT TTTTTCTTTT TCCTTTTCCT GAGTGTAGT A CAGTTAAAAT 1740 

TTCAAACAGC TCCTTGACAC TGCTTTTCAT CTTCAAAC CA GCCATTTTGT TGTACTTTGG 1300 

TAAAGGACCT CTTCCCCTTC CTCCCCTACA CATACAGATA CACCCACACA CAGACTGACT 1360 

CTCTTTCTCT CATACCCCAA GGTCATGAGT GAATGATGCT TAGTTCCTTG TAAAGAAAAT 192 Q 

CTTGGGATGG .GGAAAGGGGT AGGCAGCAAG AGGATTCAAC AAACGAAAAA CATAAAAACT 1980 

TTGTATATGA GTTTTAAAA.C AAGAGGACAA CACAGTATTT TTCAAAATTG TATATAGCGC 2040 

ATATGCATGG ACAAAGCAAG CGTGGCACGT GTTTGCATAA TGTTTAATTA CAAAAAAATA. 2100 

TTTATTCTTT AAAAA.TCTTC AAGATTATGT CTATTTGCTG TGCATTTTCT TTCAGTTTGC 2 ISO 

TTATCTTTC C CGGGTTGGGG TTGGGATAAA, , GGTGTGTCGG TTTAGCACCT CTGGAAGACC 2220 

TATCTAGAGC TCTTTCACTT TCCTGAGGTT ATTTTGCCCY TTCTGGTGTT GGTATGTCTG 2230 

TTGCCGGCCA TGGGGTNCAY GCCTTGAATT CGTGGTCTTG ATGAGGGACA AGGGAGGTCA 2340 

AGCTCTGACT AATGGCATGA CCTGATTAAG GGGTACAGCA GGGAGTTTTG TTGCTACAGC 2400 

TCATGAATTA ACCTGTCGCA AGCTAATCGC CCTCCA.TGGG ATCATGCCTC TACCCAAGGG 2460 

TTTGTGTGCC CATGTTATGG ACACAGCTGT AGGGATTCTT AAGTCCCCTG TCGCATGCAG 2520 

TGGAAGCATT TTAAAATTTC TTTTACTTTT TGGTTTTCCC TTAATTGCTG CTTTTCAGAT 2530 

TTTAGTTATG GGTCGTCTGC TCACCCGTTC TCTACATTAG GGTGTCAAAG AGAATGTTTT 2540 

GGTTTAAATA TAAATAGCCA TTCATTTAGT CTCAGATTGT GAATTTAAAA TGGTGGATAC 2700 

CGAAATTGGT TGTGTGTGTT GCTGTGGGTT TGGTTTGAAG GGAAACACCC CTAGAACATG 2760 

ATATTCCCAT CTAGTGCATT TAAATAGAAA TCACTGAGTT TGCTGCTTTT TTATTGTCAG 2320 
. CAGAT AGGAG AATT AATAAT GCATTTTAGC TGTGATGTCC ATTTTTATGA AATTCCTACT .. 2330 

AAGAGCTATG TTAAAAGTAA AGGATGGTGG TGGTTGTATT AACTATATAC CTGTTTAGGC 2940 

CATTCTGGCT GTGGTATTTT TCAATAGGTC AGGATCTGTA AATCTGTCAG TTTTATACAG 3000 

GAGTGCAGAG TGAACTAGGG AACTAGATTA AGAGGTCTAA ATATGAAATA GG AGTTGAGG 3060 

CTGAGGACCT CTTCGTCTTC CTTTAAATGT CTTTTGGCTA GGGAGTGTTT ACGATTTGTG 3120 

AGGCAGCTTT GTGTGCTCTT ACACTGTACA TCCTATTACT CCATTGGGAA GTAGGTTCAC 3130 

TTTCCTCTGG CCTTTTGCCT AAGTTAGGCT TTGCTGAA.TC AACGCTACTT TTCCTTTTAG 3240 
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AAGGTTGT TACAGGAGAT TTACTGGCAA CTGTTCTTTT CCCATCAAAA ATCAGTGAAT 3300 

GTTTGCTGAG TATAAATGCT GCTTCCTTAA ACCACTTGTC GCTTTAGGAT CAACTTTACC 3360 

5 

TGTACCTTTT CTCCTTTCCT CCCTTGCCAC CTGAGGTGCA AATCTGAACT CAGTGTCTGC 3420 

TTCTTCCATT TTCTCGTCTC TCTCCCCTCT TCCCCCATTA TCCATATGAC ATTATTTTAC 3480 

10 TTCAAATGAC AGCATCAATC TTAAAAAGAT ATACATTAAA ACTAAGGAGT TTTTTTAAAG 3540 

AAAGCCTGAA TAAGTTCCTT TCCCTGGTAA CTTTGAAAAG CAGTCAGAGT TGCTATATAG 3600 

ATATATGTGG CTC CTTT AAA ATGCTTTGTG TATGTGTGGT GTTTAAAAAA AAAAAAAAAA 3560 
TTCGGGGGGG GGCCCGGTNC CCAT -3634 



15 



20 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 116: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : IS 6 5 base pairs 
25 (3) TYPE: nucleic acid 

(C) STRANDEDNES S : double 

(D) TOPOLOGY: linear 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

AAGAAAGGGT ATTAAAATTC TAGATCACAT ATGGACCCGG GAAGGTTTTT NACCCTCTGT 60 

. TAGTGACATC GAGTCTCCCA CTAGACAAAA TAGGTGGAAA AATCTCTCGA GGGCTCACAT 120 

35 TGTTTTGTCA TCTTCAGGAA AAACACCACC AGGCCATACC ACAGCCTGCC CAGTGAGGCG 130 

GTCTTTGCCA ACAGCACCGG GATGCTGGTG GTGGCCTTTG GGCTGCTGGT GCTCTACATC 240 

CTTCTGGCTT CATCTTGGAA GCGCC CAGAG CCGGGGATCC TGACCGACAG ACAGCCCCTG 3QQ . 

CTGCATGATG GGGAGTGAAG CAGOVGGAAG GGGCTCCCAA GAGCTCCTGG TGGTGCAGCC 3 60 

TGTGCTCCCC TCAGAAGCTC TGCTCTTCCC AGGGCTCCCG GCTGGTTTCA GCAGGCGACT 420 

45 TTCTTCCAAT GCTGGGCCCA GACTTCTTGC CTGGGTGCTG GCCTGCCCTC TCCGGNCOGC 430 

TTGCTGCCTG TCTGCTTTCC TTGGTGGYTT TGCTGGGTGC TGGGCCTGCC CTCTCCGGCC 540 
GCTTGCTGCC TGTCTGCTTT CCTTGGTGGC TTTGCTGGGT GCTGGGCCTG CCTTCTCTGG ".- 600 

50 

CTGCTTGCTG CCTGTCTGCT 1 TTCCTTGGTG GCTTTGGCTT CTGCACTCCT TGGCGTCASC 660 

TCTCAGGTCC TCCATTCACA CGAGGTCCTC CTCGCTCTGG CCGCTCTTGC TGCTCCTGTC 720 

55 TGAAGAWATC AGACTGATTT CCTCTTAAGA CTCCTAGGGA TGTGGTGAAG AGCTGGGACT 730 

CAAGTGCAGT CCACGGTGTG AAACATGAGG GARGTGAGGT GTCCGTCCAC TTCCCCCATA 340 

AAGGTGTGCA TTTCAGTTAG GCTGCCCCGC CACAGAGCAG GCTTCATCTG CTCTGCCATC 500 

60 
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CAGCCCCATC TGGATGTGAG GTGGGGTGGA GACATCATGG GGTGATTGCA GAAAGGGGGA 960 

GTGGCGGCCC ACGCAGCTTC TGCTGAGGAG CTGACCGCTC TGAGCTGTTC TGTTTCGTAT 1020 

TGCTGCTCTG TGTCTGCATG TATTGTGACC GTGCGGCTCC ACCTCTTCCA GCTGCTGCTA 1080 

CAGCTGAGGC CTGGATCCCG GCCTTTCCCT GTGACTTACG TGTCTGTCAC CGGCANGCAG 1140 

CCCTACAAAT CCTGGTGACC TGCTCTCCCA AGAACAGAGC CTGTCCCCAG ATGTCCCAGT 1200 

AGCGATGAGT AACAGAGGTG GCTGTGGACT TCCTCTACTT CTCCTTGCTG GATCAGGGCC 12 SO 

TTCCTGCCTC CCGCTGGGCA GGTCTGGCCT TGCTGTGTTG GCAGGGCCCC AGCCCCTCTG 1320 

ACCACTCTGC AGCTCACCAT GCAGCTGATG CCAAAGTTGT GGTGTCCAGT GTGCAGCAGC 1330 

CCTGGGAGCC ACTGCCACCT T CAGAGGGGT TCCTTGCTGA GACCCACATT GCTTCACCTG 1440 

GCCCCACCAT GGCTGCTTGC CTGGCCCAAG CTAGCGTTCT GTGCCATGCT AGAGCTTGAG 1500 

CTGTTGCTCT TCTTCAGGGG AGGAAATAGG GTGGAGAGCG GGAAGGGTGT TGCTC CTAAG 1550 

TGTTGCTGCT GTGGCTTTTT TGCCTTCTCC AAAGACGCAC TGCCAGGTCC CAAGCTTCAG 1620 

ACTGCTGTGC TTAGTAAGCA AGTGAGAAGC CTGGGGTTTG GAGCCCACCT ACTCTCTGGC 1530 

AGCA.TCAGCA TCCTACTCCT GGCAACATCA GGCCAACGTC CACCCCAGCC TCACATTGCC 1740 

AGATGTTGGC AGAAGGGCTA ATATTGACCG TCTTGACTGG CTGGAGCCTT CAAAGCGACT 'l3Q0 

GGGATGTCCT CCAGGCACCT GGGTCCCATG ACCAGCTCGC CGTCTCCATA GGGGTAGGGA i860 

TTTCACTGGT TTATGAAGCT CGAGTTTCAT TAAATATGTT AAGAATCAAA GCTGTCTTTG 1920 
TTCAGGCTGC TATAACAAAA ATATAATAGC CTGGGTGGCT TAAAG , 1965 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 5Q3 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 117: 

AGTGATCCCC TTGCCTCGGC CTCCCAAAAT GCTGGAATTG TAAGCGTGGG CCTCTGCACC 60 

CGGCCTGGTC CGCAATTTAA AAACGCACAG CCACCATTCC OTYTCCAGAA AGCACCCAGA 120 

TGCCTTTGGG AGAACCAGCC TCCTCCATGG AGGAAAGCTT GGGATCTGCC TTCCCACCTG 130 

GGGAGGAGAG GGATCTGTGG AAAATCCTTC TGACGGACTT CCCCTCAGTG CCTGATCCAT 240 

ACTCAATAGT AGAAAAAGTA AGAAATATAC AAAGATAGCA GATACACGGA GACAGTTCCC 300 . 

CAAATAGCTG AGCGAWTAGC GCAGAAGCAA TATTGAAGAC CTAATAGCTG. AGACATTTCC 360 
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AGAACTGATA AAGTGCATCC AGC'CACAGAT CL^\CCACCCC AGAAAATTCC AGGCAGCATC 
AACAAATAAA TAGCCCCACA TGCACCCGTG AAAATGCAGA AGAjCCPuAACA AAAAAGTCCG 
GTCAACAGCG AGAGTTAAAG AGG 

(2) INFORMATION FOR SHQ ID NO: 113 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 112 3 bass pairs 
(3) TYPE : nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) .SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
GGCACAGCTT GGAATGAACC CCTGTGGATA AGGGGGACTA TTAGATAGAA TAAACATCAA 
TAAATGCTTG ATGAATAAAC GCTAATCCTA CCTTCCCAGC CTGACACCTC CCAGTGGACA 
CCACACTTCA CTTGAAGCCT TAGAAACCTT TCCCACCCAT GCTTCCAGCC CTGGCTTCAT 
GTTGCCATTT CTCACCCCCA GAACAGGCCG . CCCGCCTGAA GAAACTACAA GAGCAAGAGA 
AACAACAGAA AGTGGAGTTT CGTAAAAGGA TGGAGAAGGA GGTGTCAGAT TTCATTCAAG 
ACAGTGGGCA GATCAA.GAAA AAGTTTCAGC CAATGAACAA GATCGAGAGG AGCATACTAC 
• ATGATGTGGT GGAAGTGGCT GGCCTGACAT CCTTCTCCTT TGGGGAAGAT GATGACTGTC 
GCTATGTCAT GATCTTCAAA AAGGAGTTTG CACCCTCAGA TGAAGAGCTA GACTCTTACC 
GTCGTGGAGA GGAATGGGAC CCCC^AAGG CTGAGGAGAA. GCGGAACOTG AAGGAGCTGG 
CCCAGAGGCA ANGAGGAGGA GGCAGCCCAG CAGGGGCCTG TGGTGGTGAG CCCTGCCAGC 
GACTACAAGG ACAAGTACAG CCACCTCATC GGCAAGGGAG C^GCCAAAGA CGCA3CCCAC 
ATGCTACAGG CCAATAAGAC CTACGGCTGT KTGCCCGTGG CCAATAAGAG GGACACACGC 
TCCATTGAAG AGGCTATGAA TGAGATCAGA GCCAAGAAGC GTCTGCGGCA GAGTGGGGAA 
GAGTTGCCGC CAACCTCCTA GGCGCCCCGC CCAGCTCCCT TTGACCCCTG GGGCAGGGCA 
GGGGGCAGGG AGAGACAAGG CTGCTGCTAT 'TAGAGCCCAT CCTGGAGCCC CACCTCTGAA 
CCACCTCCTA CCAGCTGTCC CTCAGGCTGG GGGAAAA.CAG GTGTTTGATT TGTCACCGTT 
GGAGCTTGGA TATGTGCGTG GCATGTGTGT GTGTGTGTGA GAGTGTGAAT GCACAGGTGG 
GTATTTAATC TGTATTATTC CCCGTTCTTG GAATTTTCTT CCCATGGGGC TGGGGTACTT 
TACATTCAAT AAATACTGTT TAACCCAAAA AAAAAAAAAA AAAAGAAAGA AGN 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(2) . INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 base -pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNE33 : double 

(D) TOPOLOGY: linear 

SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

GGGCACAGCT GAAGCTGCAG ACCTCCCCAG GGGATGGCTC CTCTCCCCCA GGAGCCCCGA 60 

GGCAGGGGAG GCAGAAAGCC TGGGCTCTGG GGGGTGGCCT GCGGACAGCT GTGCTGTGGG L2Q 

CCGGGCGCTG GGCCTGTCCC ACAGGGNCGT GGAGCTCGTG GTTCTGAGCA GCCAGCTGGG 180 

TGGTGTCTGG GGATAGCTGG GAGGCACAGC GGCTGCCATG TGGGACTGGG ACTGGAGTGC 240 

TCCCTGGTCT TGGCCTCTGT GGCTCAGCCT TGCTCTGGTC TGCCTGAGTG CAGGGGCCAA 300 

GGGGCACAGG GCCAGTGAGG CCGGCCACGC TCGGGCCCTC ACCTGTGAGA TGGGGTCGGA ' 360 

ATTTKACACA GCCTANGGCT TGGTTCTTGG TKGTNGAMCG TGGACTYCTK AGAAjCGGGAG 420 

TGCTGGTCCT . GAAAGGCGTG GTTGGAGACC AGCTGCTTTT CTCGCTGTTT TTCTCTTAGG 430 

AGATTAAACA AAAACAGAAA GCACAAGACG AACTCAGTAG CAGACCCCAG ACTCTCCCCT 540 

TGCCAGACGT GGTTCCAGAC GGGGAGACGC ACCTCGTCCA GAACGGGATT C^GCTGCTCA 600 

ACGGGCATGC GCCGGGGGCC GTCCCAAACC TCGCAGGGCT CCAGCAGGCC AACCGGCACC 650 

ACGGACTCCT GGGTGGC'GCC CTGGCGAACT TGTTTGTGAT AGTTGGGTTT GCAGCCTTTG 720 

CTTA.CACGGT CAAGTACGTG CTGAGGAGCA TCGCGCAGGA GTGAGGCCCA GGCGCCGAGA 730 

CCCAAGGCGC CACTGAGGGC ACCGCGCACC AGAGCGTGAC CTCGGCAGGC TGGACACACT 840 

GCCCAGCACA GGCAGACCCA CCAGGCTCCT AGGTTTAGCT TTTAAAAACC TGAAAGGGGA 900 

AGC AAAAAC C AAAATGTGTG ACT GGGCTTT GGAGGAGACT GGAGCCTCAG CCCTGTCCTG 950 
GCCACGGGCC GCTGGGGCTG GTGTGGGTGG . GCCTTGTGTG CTGGATTTGT AGCTTATCTT ' 1020 

CCGTGTTGTC TTTGGACCTG TTTTAGTAAA CCCGTTTTTC ATTTTAAAAA AAAAAAAAAA 1030 

AAACTTTGGG GGGGGGCCCC N 1101 

(2) INFORMATION FOR SEQ ID NO: 120: 
(1) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 12 C : 

AGCTTCTCTG TCCAGTCTTG AACTCTGGGS TCTCTTGGAA CTTTCCTCAC CCCTCTCAGC 60 

J CTGAATATTC CTTCCATGGA TTCCACTCAA CCAGACTTTG GATCTGTGCC TACTTAATCA 120 

ACCTTATCTT TGCAATATGT TCGGGCCCAC CTTCCACTCC TTGGTTCTTG TTCCTCCTTG 130 

GCCTAACTTG TCCCTTCTCC ACITCACATC CCCGGTGGGA CAGCATTCCT CCTTCCTCCC ? 40 

10 

AACCTCCCTC CGTCTCAPAA AAAAAAAAAA AAAAAAAAAA TT 232 



15 

(2) INFORMATION FOR SEQ ID NO: 121: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2535 basa pairs 
20 (3) TYPE: nucleic acid 

(C) STRANDEDNES3 : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 121: 

25 

TAAGGGGGTG TGTGCTCACC TCCTCCTGAC t< CCTTAACACT CCTGTCCTGC CCAGACCAAC 60 
AGAGAGAGCT GTCCCTGAGA CCCCGGAGAG AAGCAGCTGC CC-AAA.GCTGC AGCCTTTCCG 120 
30 CACTCTGAGA CCATGATCTT CCTCCTGCCA GGGGAGAGCC ACCCACAGGC CATGTCCAGC 130 



CCCACTTCCC TCAGCCCCCA GGG YTTCCTT CTGGCCCCTC TGAGGATTCC CTAGGGCTGC 240 



CCCGCAGAGG GGYTTCCCCA. AGCTCTGTTT TGAAGCCTCC AATGTGGAAA AGTGAGAAGT 300 

35 

CAGAGGGAAC AGGACAGGTG CAGCCGGGCT CTGAGGCCAC ACCTCACACC TCGCTGTTCC 360 



CCAACATCCC CTGAGCAGTG TGAGCTCATC TCACCAGATG AGAAGAGGCC CTGTGCATTT . 420 
40 YTTTTGTTTG TTTGTTGCTG TTTTCCCCCA CCCATCCAGT TCTCCTCAGC AAAGCAAATT 430 



CCTTAACACC TTTGGTGGAG AATTTCTTAC CCAGACTTGG GGCTGTGATG CCCTTCAGTG 540 



CGTGGTGAGT GCACCGTGTG TGCGTGTGCC TGTGTGTGAA CCTGGGGGCC ATCCTGGTGG .600 

45 

CCTGGGAGCG TGAGGAGAGG CCCCCTGTGT GCTGGGTGAG TGGTGGGTGT GGGGTCAATG 660 



C^GTGAGGCT CTCTGGGTGA GGCTCCCAAC CTGGCAGTCC CCAGCCTCCC AGCATCTGTG . 720 
50 AGCGTCTGTT GGACTTTACA GAAGAGCCTC ATCCYGTCTG CCCCTCACTC TGCCCTGGAA 730 



TCAACATCTT CCGAGTCCTT CTTGGGGGAA ATAGCAGAGC CCCACTTAAC TCCATAAACT 840 



GCTTCOCATT CCGCAGCCCA GTTCTGATTG TTGAGGTGTC GCGTCGTTCC AGGTCCCCCA 900 

55 

GTCCCCTCTT TCTCCTGTCC TCTCTCTGTC CTTCACCTCC CCACTCCAGC CCCGGCTCAG 960 



TTCAGGGAAA TGCTGTTCCA YATCAGCCCT CTGCTCTCTG ?JZGCXCCCQC GCCTCTGACT 1020 
60 CGGAGCTACT TGAAACTTCT GCTCTTGCTA GGATTGGAGT CTACCTATCT CTTCCATTTG 1080 
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TCCCAGCTGG AGTTCTGGAA CTTTCCTCCT CGGGGTG-'SGG GTGGGGGTGG T7AAGGATGC 1140 

TGGGGGGCCT GGGGAAGGAA GGAGTTCAGA GGAAGGGGGT CCCGTGGCCT CTrGAGGTCA 1 ?0Q 

5 

CCCTCCGCTC CTGGGACACG TGCTCTCTCT GTCGCTC-GGT CTTCTGCCGG TC-tl-jCGTrTG 1250 

TGTGTCCTTG TAAATATGTT TTAGGAAGAA AGCAAAAGGG ACTGAACGAG CCGrGGGTAG U2Q 

10 GATTGCAGGG GTCCAGCCTT GCCTGTTTCC GAAGCCCCCA CACGGC TTTT CGCGCCACGG U 3 Q 

AGACTGGTCC CCTCAAAAGG TAGACAAAAC AGCAGCTCCC TGTGGAGGGG AAGGGCGGCC 1440 

TCAAAGTGGG TTTTTGTTAG ACAAGGTTAA GGTTGCGGGA. TGAC-GAAGGT TGCAGA.TCGG 1500 

15 . 

TCCTTCCTCA GCTOGTTGAT TTGTGAGCTT GACCAAGGGG CCTGCCACCC AC-CrCCTCC-. 1550 

GTGCGCTCTC CTCGATGCCT CGCTCCTTCC TGCCCCCACT CCCCTGC-m AGGCAGGTAG 1520 

20 GGGAATTAGG GCGATGCTGG AAGAAGCTTA ACCATGTGTT CAAAGAA2GG TT7GTTGCTT 1530 

GCTTGGTCCT GGAACTCCCC TTGGCTGCGC CAGGCCTCC7 TGGGGGATGG GGGTTGGGGG 1740 

AGGTGGATGT CAGATCTGGT AGGTTGCAGG AGAGAAAATA AATG7GCC7T GAGAGA.CCA.C 1300 

25 

TGAGAGAGGG TCCAAGGGTG ATGGAGAAGG AAGCATGGCC TGG3AGGTTG GAAGGGARGG 13 6 Q 

GTGGTGGGTG GCGGCATCTT GACTGCCCCC GGTIG7CCCA CACGTGGGGG GTC-G-CACCG 1920 

30 CrCTTCACTG CAGCCCGCCT GCCTTCAGCC GTCCAGGAGC TTCACCGGC7. TCCAAGTTCA 1930 
CTTTGGAGGG GGTGGGGTGG GTTGGCATCA ACACGGGGA.C CCTCTC-7TGC AC GAAAGCCC ■ 2040 

GAGCCCTCAG CCCCTGGGGA GAACAAA7GG CTGAGCTTTG ATAC CGGGGG TGGGGGAGAG 2100 

35 

GCTGCGGGCT GGGGGGAGTC CGAGGGGAGA GACACCAGAG AAGGAGAC GC AGAGAGCCCG 2150 

AGGAAGTTGG CAGCAGAGCA AACTGGTTTG CAGCC7GAAG CC7GCTGAAA CGGGGTGATG 2220 

40 TGCAATAACT GAGCTTAGAG TTAGGAATTG 7GTTCAAGTG CTTGGATTTG - C GGC7GTAGA 2230 

TTTAACTGGT GAAATTGTAT CTCTCAOTAA TTTGAGA.GGT CTTTTA.-AAA ACGGAAAAAC 2340 

AAAGTGTTAG ACTGTGTGCG TGTGCGTTGA TGGGGACTCA AGAGTCCCGT GAGGCATCCA 24Q0 

GCCCTGCCTT TCCCCTGCGC CCCCATCCTC TCACGTCCCG CCCiGCGGCG ACGGGGGGAC 2450 
CCTGCCTCGT GTCGTCTTTA TCTGCCTATT AjCTCAGCGTA. AGGAAA_T-_AG TA.CACTCCAC .-. 2520 

50 ACATGCATAA AGGAAATCAA ATGTTATTTT TAAGAAAATG GAAAATAAAA ACGTTATAAA 2530 

CACCAAAAAA AAAAAAAAAA. ACG-GMGGG3G GGGGCCGGTA ACCCAT7TCG CCGAA 2535 

55 

(2) INFORMATION FOR SEQ ID MO: 122: 

(i) SEQUENCE CKARACTE?.ISTICS : 
60 (A)- LENGTH: 994 bass pairs 
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(E) -TYPE: nucleic acid 

(C) STHANDEDNE33 : double 

(D) -TOPOLOGY: linear 

(;ci) SEQUENCE DESCRIPTION: SEQ ID NO: 122.: 

GAATTCGGCA GAGGTTCGGC GAAGATAGGG AATAAGGAAG C*iGAGGAGTA CGGGAGAAGG 60 

AAGCACAGGA GTAGGGGAGA TATACAGCGG TCAGGATAAG GGGGAAAGGG CGGTGGTTGC 120 

SCAAGAGGTG AAACAAGATG TGAGAGACAA GGGGTAGGGA AGAAATGGGG CAGCGGTTAG 130 

GTTCAGAAGC GCATAG AC CG TGGCC-GACG-G GC AATGCGAG GGGCACAGAA AGGAACTGAG 240 

GGGTGGGCTA TTTTAARGGA GATGGTCCTT CAGCCCTCTT YTTTTCTGCG TAGTTCTCCT 300 

CCTCCAGGCC GCGCGCGGAT ATGTCGTCCG GAAACCAGCC CAGTCT AGGC TGGATGATGA 3 60 

CCCACCTCCT TCTACGCTGC TCAAAGACTA CCAGAATGTC CCTGGAATTG ACAAGGTTGA 420 

TGATGTCGTG AAAAGACTCT TGTCTTTGGA AATGGCCAAC AAGAAGGAGA TGCTAAAAAT 430 

CAAGCAAGAA CAGTTTATGA AGAAGATTGT TGCAAACCCA 'GAGGACACCA GAT CCCTGGA 540 

GGCTCGAATT ATTGCCTTGT CTGTCAAGAT CCGCAGTTAT GAAGAACACT TGGAGAAACA 60 Q 
TCGAAAGGAC AAAGCCCACA AAGGCTATCp' GCTAATGAGC ATTGACCAGA GGAAAAAGAT - 660 
GCTCAAAAAC CTCCGTAACA CCAACTATGA TGTCTTTGAG AAGATATGCT GGGGGCTGGG . 720 

AATTGAGTAC ACCTTCCCCC CTCTGTATTA CCGAAGAGCC CACCGCCGAT TCGTGACCAA 780 

GAAGGCTCTG TGCATTCGGG TTTTCCAGGA GACTCAAAAG CTGAAGAAGC GAAGAAGAGC 340 

CTTAAAGGCT GCAGCAGCAG CCCAAAAACA AGCAAAGCGG AGGAACCCAG ACAGCCCTGC 900 

CAAAGCCATA CCAAAGACAC TC=AAGACAG CCAATAAATT CTGTTCAATC ATTTAAAAAA 960 

AAAAAAAAAA AAAAAAAAAA AAAAAGGGGA GGOG 994 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1542 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double ^ 

(D) TOPOLOGY : linear 

(:ci) SEQUENCE DESCPJCPTICN : SEQ ID NO: 123:' 



GGCASAGCCA CCTCGGCCCC GGGCTCCGAA GCGGCTCGGG GGCGCCCTTT C GGTCAACAT 60 

CGTAGTCCAC CCCCTCCCCA TCCCCAGCCC COGGGGATTC AGGCTGGC'CA GCGCCCAGCC 120 

AGGGAGCCGG CCGGGAAGCG CGATGGGGGC CCCAGCCGCC TCGCTCCTGC TCCTGCTCCT 130 

GCTGTTCGCC TGCTGCTGGG CGCCCGGCGG GGCCAACCTC TCCCAGGACG ACAGCCAGCC 240 
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CTGCACATCT GATGAAACAG TGGTGGCTGG TGGCACCGTG GTGCTCAAGT GCCAAGTGAA 300 

AGATCACGAG gactcatcgg tgcaatggtc TTAACCCTGC TCAGCAGACT CTCTACTTTG 360 

5 - 

GGGAjGAAGAG agcccttcga gataatcgaa ttcagctc-gt tamctctacg GCCCACGAGC 420 

TCAGCATCAG CATCAGCAAT GTGGCCCTGG CAGACGAGGG CGAGTACACC TGCTCAATCT 4S0 
10 TCACTATGCC TGTGCGAAGT GCCAAGTCCC TCGTCACTGT GCTAGGAATT CCACAGAAGC " 540 

CCATCATCAC TGGTTATAAA TCTTCATTAC GGGAAAAAGA CACAGCCACC CTAAACTGTC 600 

AGTCTTCTGG GAGCAAGCCT GCAGCCCGGC TCACCTGGAG AAAGGGTGAC CAAGAACTCC 660 

ACGGAGAACC AACCCGCATA _ CAGGAAGATC CCAATGGTAA AACCTTCACT GTCAGCAGCT 720 

CGGTGACATT CCAGGTTACC CGGGAGGATG ATGGGGCGAG CATCGTGTGC TCTGTGAACC 730 

20 ATGAATCTCT AAAGGGAGCT GACAGATCCA CCTCTCAACG CATTGAAGTT TTATACACAC 840 

CAACTGCGAT GATTAGGCCA GACCCTCCCC ATCCTCGTGA . GGGCC AGAAG CTGTTGCTAC 900 

ACTGTGAGGG TCGCGGCAAT CCAGTCCCCC AGCAGTACCT ATGGGAGAAG GAGGGCAGTG 960 

25 

TGCCACCCCT GAAGATGACC CAGGAGAGTQ CCCTGATCTT CCCTTTCCTC AAGAAGAGTG 1020 

ACAGTGGCAC CTACGGCTGC ACAGCCACCA GCAACATGGG CAGCTACAAG GCCTACTACA 1QS0 

v 30 CCCTCAATGT TAATGACCCC AGTCCGCTGC CCTCCTCCTC ^CAGCACCTAC CACGCCATCA 1140 

TGGGTGGGAT GGTGGCTTTC ATTGTCTTGC TGCTGCTCAT CATGCTCATC TTCCTTGGCC 1200 

ACTACTTGAT CCGGCACAAA GGAAC CTACC TGACACATGA .GGCAAAAGGC TCCQACGATG 1260 

.35 

CTCGAGACGC GGACACGGCC ATGATCAATG CAGAAGGCGG GCAGTCAGGA GGGGACGACA 13 2 Q 

AGAAGGAATA TTTCATCTAG AGGCGCCTGC CCACTTCCTG GGCGCGG CAG GGCCCTGTGG 1330 

40 GGACTTGCTG GGGCCGTCAC CAACGCGGAC TTGTAGAGAG CAACCGCAGG GGCCGSCGCT 1440 

CCCGNTTGTT CCCCAGGCCA CCCACCCGCT TGTTACAGAA TGTYTKGTTT GGGGTGCGGT 1500 

TTTGTWATTG GTTTNGGATN GGGGAAGGGA GGGANGGCGG GG 1542 

45 



(2) INFORMATION FOR SEQ ID MO: 124: 

50 

(i) SEQUENCE CHARACTERISTICS : 

(AJ LENGTH: 13 90 base pairs 
(S) TYPE: nucleic acid 
•(C) STRAMDEDNES3 : double 
55 (D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

CAAGCTCTAA TACGACTCAC TATAGGGAAA GCTGGTACGC CTGCAGGTAC CGGTCCGGAA 60 

60 
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TTCCCGGGTC CACCCACGCG TCGGGGCCTC AGGGTGGACG CATGGTTCTG CACTGAGGCC 120 

CTCGTCATGG TGGCGCCTGT GTGGTACTTG GT AGCGGCGG CTCTGCTAGT CGGCTTTATC 130 
5 CTCTTCCTGA CTCGCAGCCG GGGCCGGGCG GCATCAGCCG COCAAGAGCC ACTGCACAAT ■ 240 

GAGGA.GCTGG OGGAGCAGG CCGGGTGGCC CAGCCTGGGC CCCTGGAGCC TGAGGAGCCG 300 

AGAGCTGGAG GCA.GGCCTCG GCGCCGGAGG GACCTGGGCA GCCGCCTACA GGCCCAGCGT 360 

10 

CGAGCCCAGC GGGTGGCCTG GGCAGAAGCA GATGAGAACG AGGAGGAAGC TGTCATCCTA 420 

GCCCAGGAGG AGGAAGG7GT CGAGAA.GCCA GCGGAAAYTC ACCTGTCGGG GAAAATTGGA 480 

15 GCTAAGAAAC TGCGGAANNT GGAjGGAGAAA CAAGCGOGAA AGGCCCAGCK TGAGGCAGAG 540 

GAGGCTGAAC GTGARGWGCG GAAACGACTC GAGTCCCAGC GCGAATGMT GGAAGAAGGA 600 

GGAGGAGCGG CTTCGCCTGG AGGAGGAGCA GA.AGGAGGAC GAGGAGAGGA AGGCCCGCGA 660 

20 

GGAGCAGGCC CAGCGGGAGC ATGAGGAGTA CCTGAAACTG AAGGAGGCCT TTGTGGTGGA 720 

GGAGGAAGGC GTAGGAGAGA CCATGACTGA GGAACAGTCC CAGAGCTTCC TGACAGAGTT 730 

25 CATCAACTAC ATCAAGCAGT CCAAGGTTGT GCTCTTGGAA GACCTGGCTT CCCAGGTGGG 340 

CCTACGCACT C AGG AC AC C A TAAATCGCAT CCAGGACCTG CTGGCTGAGG GGA.CTATAAC 900 

AGGTGTGATT GACGACCGGG GCAAGTTCAT CTACATAACC CCftGAGGAAC TGGCCGCCGT 950 

30 

GGCCAACTTC ATCCGACAGC GCGGCCGGGT GTCCATCGCC GA.GCTTGCCC AAGCCAGCAA 1020 

CTCCCTCATC GCCTGGGGCC GGGAGTCCCC TGCCCAAGCC CCAGCCTGAC CCCAGTCCTT -1080 

35 CCCTCTTGGA CTCAGAGTTG GTGTGGCCTA CCTGGCTATA CATCTTCATC CCTCCCCACC 1140 

ATCCTGGGGA AGTGATGGTG • TGGQCAGGCA GTTATAGATT AAAGGCCTGT GAGTA.CTGCT 1200 

GAGCTTGGTG TGGCTTGGTG TGGCAGAAGG CCTGGCCTAG GATCCTAGAT AAGCAGGTGA 1260 

40 

AATTTAGOCT TCAGAATATA TCCGAGAGGT GGGGAGGGTC CCTTGGAAGC TGGTGAAGTC 1320 

CTGTTCTTAT TATGAATCCA TTCATT CAAG AAAATAGCCT GTTGC AAAAA AAAAAAAAAA 1380 

45 AAAAACTCGA 1390 



50 (2) I^IFQRMATIGN FOR SEQ ID MO: 125: 

(1) SEQUENCE CHAKACT5P.I3TICS : 

(A) LENGTH: 1233 base pairs 
(3) TYPE: nucleic acid 
55 (C) STSANDEBNE5S : double 

CD) TOPOLOGY: Linear 

■ (:ci) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



60 



C-GCGGGCGGG TGAAAGGCGC ATTGATGCAG CCTGCGGCGG CCTOGGAGCG CGGIGGA^CA 



•50 
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GACGCTGACC ACGTTCCTCT CCTCGGTCTC CTCCGCCTCC 


AGCTCCGCGC 


TGCCCGGCAG 


120 


CCGGGAGCCA TGCGACCCCA GGGCCCCGCC GCCTCCCCGC 


AGCGGCTCCG CGGCCTCCTG 


iao 


CTGCTCCTGC TGCTGCAGCT GCCCGCGCCG TCGAGCGCCT 


CTGAGATGGG 


CAAGGGGAAG 


240 


CAAAAGGCGC ATCCGGCAGA GGGAGGTGGT GGACCTGTAT 


AATGGAATGT 


GGTTACAAGG 


300 


GCCAGCAGGA GTGCCTGGTC GAGACGGGAG GCCTGGGGCC 


AATGGCATTC 


CGGGTACACC 


360 


TGGGATGGGA GGTCGGGATG GATTCAAAGG AGAAAAGGGG 


GAATGTCTGA 


GGG AAAGCTT 


420 


TGAGGAGTCC TGGACACGGA ACTACAAGCA GTGTTCATGG 


AGTTCATTGA 


ATTATGGCAT 


4S0 


AGATGTTGGG AAAATTGCGG AGTGTACATT TACAAAGATG 


CGTTCAAATA 


GTGCTCTAAG 


540 


AGTTTTGTTC AGTGGCTCAC TTCGGCTAAA ATGCAGAAAT 


GCATGCTGTC 


AGCGTTGGTA 


600 


TTTCACATTC AATGGAGGTG AATGTTCAGG ACCTCTTCCG 


ATTGAAGCTA 


TAATTTATTT 


650 


GGACCAAGGA AGGGCTGAAA TGAATTCAAC , AA.TT AATA.TT 


GATCGCACTT 


CTTCTGTGGA 


720 


AGGACTTTGT GAAGGAATTG GTGCTGGATT AGTGGATGTT 


GCTATCTGGG 


TTGGCACTTG 


730 


TTCAGATT AC CCAAAAGGAG ATGGTTCTAC 1 <TGG ATGGAAT TCAGTTTCTC 


GCATCATTAT 


340 


TGAAGAACTA CCAAAATAAA TGCTTTAATT TTCATTTGCT 


ACCTCTTTTT 


TTATTATGCC 


900 


TTGGAATGGT TCACTTAAAT GACATTTTAA ATAAGTTTAT 


GTATACATCT 


GAATGAAAAG 


960 


CAAAGGTAAA TATGTTTACA GACCAAAGTG TGATTTCACA TGTTTTTAAA 


TCTAGCATTA 


1020 


TTCATTTTGC TTCAATCAAA AGTGGTTTCA ATATTTTTTT 


TAGTTGGTTA 


GAATACTTTC 


1030 


TTCATAGTCA CATTCTCTCA ACCTATAATT TGGGAATATT 


GTTGTGGTCT 




1140 


CTGTTAGTAT AGCATTTTTA AAAAAATATA AAAGCTACCA 


A TCTTTGTAG 


AATTTGTAAA 


1200 


TGTTAAGAAT TTTTTTTATA TCTGTTAAAT AAAAATTATT TCGMACAAGG 


TTAAAAAAAA 


1260 


AAAAAAAAAA AAAAAAAAAA AAAAANAA ■ 






1233 



(2) INFORMATION FOR 3EQ ID NO: 12b: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1517 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : dcubla 

(D) : TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 
AGTGGCTTAA AGGCATCGTT TTAGGGATTA CTGGGAAGTA TCTTCAAAGT AATACATGAG 60 
AAACATTCCT TCCTAAATCC TTTATTATAT TGAATATCGT ATTAATTGGT TTTCAGAGGT 120 
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T AAATIA ACC ATGTATTCCT GC\ATAAATG TCACTTGTNT CTTGTATATA ATCTTTTTTA 
TATATTACCG GATTGATTCA TTAGTATTTT GTTGAGGATT TTTGTGTCTA TATTCATAAG 
AGATGCTGGT CTGCAGTTTT CTTTTTTTGT GATAATCTGG TTTTTGTATC AGTAATACAG 
GCCCCATGAA ACGAGTTGGG AAGTGTTCAC CTCTCTTGTA TTTTTTCAAG AGTTTGTGAA 
GAATTGCTAT TAATTGTTTA AATGTTTGGT AGAATCTACC ATTGAAATCA TGTGTCCTGG 
GCTTTTTTTT GAGGGAAGTG TTCTGATAAC TAATTGAGTA TCTACTTTTT ATAGCTCTGT 
TCAGATTTTG GTTCTTCCTG AGTTAGTTTT GGTAATTTGT GTATCTCTAG GARTTTGTCC 
ATTTCATTTA TCTGATTTGT TGGCATAAAT TAAACTAAAT TTGGCCTGAG CCTACCTGTA 
TATCTTGAGT CCCTCTGTAA GGAACTGTAG CCTAACTTGT ACATAAACAA ACTGAAATCC 
TAAATTAGGA ATGTAGTTTT TGTAACAGCT CCTGAGTCTC AGGCAGTGAC AGCAGYCAAG 
TCTGTCAATT GCAGGCTGCT AACTAAGCAG CCGATG5TGA AATGAGGCAA AAACCTTTGC 
TTTTAACACA TAGTATAGCT TTGTAATCCT TTTCTTGCAC ACTCGGGTAr. TTTCTTCCTT 
TTTCATTCCC KGWATTTTCC AKGAATATGA RTCTYCCTTT TTTGGGCTCC TGTCAGTCTA 
GCTAATGGTT TGTCAATTTT GTTGATCTT'I 1 / TGAARAACAA AC CTTTGGTT CCACTTTCTT 
GTTGCATATG CTCARTATTC TCATAATTGG AGTGGAAAGC TGATCTTTGA TTACTTATTT 
TACTTAGGGC TGAGGAGTTC ATGGACTTCG O^AAACCTCC TTGAATCTAA ATTGCATCTT 
CTTTCCTGGT TTCTGGGCTG AAACATGTTT TTTCCCATCT WAMAWACCCT TGGTCTTTTC 
ATKGGCGATT AAGACTAGAG AAAGTTCTAG ATMCCTTGTC CTTTTATGCT GTCATTTTGT 
TTAAAGGCTT TCTATGTAGT AAAACTATCT ATATAGACAA AATAGAGCCT TGAGTTGTGG 
TCTTGAATTT GATCAACATG ATTTAGCACA TTCTGTACTG GATATTTCTT CACCTGCTGC 
TACTGTAAAC CATTTTATTC TTGGATCTTC TGTAGAGTAT ATTATCACAG GTACTTTTTA 
CAGGGGTGTC TAATCTTTTG GCTTCCCTGG GCACATTGAA AGAAGAAGAA TTGTCTTGGG 
CCACACATCA AATACGCTAA CACTAATAAT AGTTGATGAG CTAAAAAAAA AAAAAAAAAG 
GCAAAAAAGN CCGAAAA . 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE OiftRACTEaiSTICS : 

(A) LENGTH: 1073 base pairs 
■ (3) 'TYPE : nucleic acid 

(C) STFANDEDMES3 : double 

(D) TOPOLOGY : linear 

(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO : 127: 
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TGAATCTATT CTTTGAACAT TCTACAACAA GAATTAO.TT ATACTGTTAT ACCAGAGTAC SO. 

TTCTGCAGTG TGAAATAGAT TGGTTTGGAA AATGAACCTG GCTTTGCTAT AAATTACATT . 120 

CACAGGCCTT TTTGCAAATG TGTAACTTGC CTATCAAAGT AGTTTGTAGG GCAAATGCAG 130 

AATATATGTC TCCATCTGGT AAAGTACCTT WTAYTCATGT GGGAAATCAA GTAGTATCAG 240 

AACTTGG7CC AATAGTCCAA TTTGTTAAAG CCAAGGGCCA TTCTCTTAGT GATGGGCTGG 300 

AGGAAGTCCA AAAAGCAGAA ATGAAAGCTT ACATGGAATT AGTCAACAAT ATGCTGTTGA 360 

CTOGAGAGCT. GTATCTTCAG TGGTGTGATG AAGCTACAGT AGGGRMGATC ACTCATGMTA. 420 

GGTATGGWTC TCCTTACCCT TGGCCTCTGW WTCATATTTT GGCCTATCAA AAACAGTGGG 48.0 

AAGTCAAACG TAAGNTGAAA GCTATTGGAT GGGGAAAGAA GACTCTGGAC CAGGTCTTAG 540 

AGGATGTAGA C CAGTGCTGT CAAGCTCTCT CTCAAAGACT GGGAACACAA CCGTATTTCT 500 

TCAATAAGCA GCCTACTGAA CTTGACGCAC TGGTATTTGG CCATCTATAC ACCATTCTTA 660 

CCACACAATT GACAAATGAT GAACTTTCTG AGAAGGTGAA AAACTATAGC AACCTCCTTG 720 

CTTTCTGTAG GAGAATTGAA CAGCACTATT TTGAAGATCG TGGTAAAGGC AGGCTGTCAT 7S0 
AGAGTTATGT GTTAGTCTCA GGAGTCTTAA CTTTTGAAAT ATGTTTTACT TGAATGTTAC ^ . 840 

ATTAGATATT GGTGTCAGAA TTTTAAAACC AAATTACTGC TTTTTGAAAC CTCAAATTAT 900 

ATAATGTATC TTATGTATGT GCTTTATATT GTTATTTGTG TATAGATTAA AATAATTCTG 960 

AATTATTTAA TCTGATATGT TGTATTCTGT ATCTTGAAAT TTTTGTTTCC TTGAAACATG 1020 

CATGCATTTA AAAATAAAGC TTAAACAACT GTAAAAAAAA AAAAAAAAAA CTC '1073 



(2) jSIFCFHATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 300 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNE3S : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION : 3EQ ID NO : 123: 



CAACCCCTGC CTTTTTTTTG TTTTCCATTT GCTTGGTAGA TCTTCCTCCA TCCCTTTATT 60 

TTGAGCCTAT GTGTGTCTCT GCCCGTGAGA TGAGTCTCCT GAATACAGCA CACTTACTGG 120 

TCTTGACTCT GTATCCAATT TGCCAGTCTG TGTCTTTCAT TTGGAGCATT TAGCCCATTT 130 

ACATTTAAGG TKAATATTGT TATGTGTGAA TTTRATCYTR TCATTATGWT GTT AGCTGGT 240 

TATTTTGCTT GTTAGTTGAT GCAGTTTCTT CCMGGCATCA ATGGTCTTTA CAAOTTGGCA 300 
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(2) EHFORULATION FOR SEQ ID NO: 129: 

5 

( i ) SEQUENCE OiARACTERISTICS : 

(A) LENGTH: 1275 bass pairs 
(3) TYPE: nucleic acid 
(C) STRAWDEDNE3S : double 
10 (D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 129: 

GGCAGAGCCT GTCCCTGCTG CCCCTGCAAA AAAAACCCCC TCTGGTGTGA GCAGGATGGT 60 

TGGAGGTTAT GTGAGCTCCT TCTCCTTTCC TCCAGTTTCC TCTTCCCTTC TCCTCCCTGC ' 120 

CTCTTTTGCT TTTCCCTTTC TTCCTGGTAC CCCCTGCCCA TTCCTGTATT TTCTCCCATC 190 

20 GCCATTCTCC CCTCTCOCAC TGTCCCTAAC CCGTTCAAAC TCTTTCCTCT TAAATGGTTG 240 

AGATTTTCTC TCACCAAGCA CACCCCAGTA TTAATTAAA.C TAGCTGCAAA CAGGCAGCAA 300 

GTGGTCTACC ATGACAGATG GGTT TTG TGT GTGTGTGTGT GTGTGTAATT GTAATAAAAC 360 

25 ■ 

ATATTGARTC ACTCAATAAA C^CAGAGTGX CTACTACATG TATCARGCAC TATCATAGAT 420 

GCTAATTAAC GAAACTGAAA TGGCCAGGCC CTCACAGTGG CTCATGCCTA TAA.TCCC.2iGC 430 

30 ACTTTGGGAG GATGAGGCAG GAGGATCACT TGAGGCCGGG AGTTCAAGAC C^GCCTGGGC 540 

AACATAGTAA GACTC CATCT CTACAAAAAA AAAATTTTTT TTATTA.TACT TTAAGTTTTG 600 

GGTTACATGT GCAGAACGTG TAGTTTTGTT ACATAGGTAT ATACGTGCCC TGGTAGTTTG 660 

35 

CTGCA.COCAT CAACCCATCA CCTACATTAG GTATTTCTCC TAATGTTACC CCTCTCCTAG 720 

CCCCCCACCC CGTGACAjGGC CCTGGTGTGT GATGTTCCCC TCCCTGTGTC CATGTGTTCT 780 

40 CATTGGTCAA CTCTCACCTA TGGAGTGAGA ACATGTGGTA TTTGGTTTTC TGATCTTGTG 340 

ATAGCTTGCT GAGAATGTKG GTTTCCAGCT TTATCCACGT CCCTGCAAAjG GGCATAAACT 900 

CATCCCTTTT TATGGCTGCA TAGTGTTCCA TGGTGTATAC GTGCCACATT TTCTTAATCT 960 

45 

ATCATTGATG GAC^AGTTTT GCTATTGTGA ATAGTGCCAC AATAAA.CATA CGTGTGCGTG 1020 

TGTCTTTATA GCAGCATGAT TTATAATCCT TTGGGTATAT ACCCAGTAAT GGGATCACTG 1080 

50 AGTCAAATGG TATTTCTCGT TCTAGATCCG TAAGGAATTG GCACACTGTC TTCCACAA.TG 1140 

TTTGAACTAA TNTACACTCC CACCAACAGT GTAAAAGTGT TTCTATTTTT CCACAACCTC 12Q0 

TCCAACATCT GTTATTTCCT GACTTTTTAA TGAACGTCAT TCTAACTGGC GTGAGATGGT 1260 

55 

ATCTCATTGT GGTTT 1275 



60 
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(2) INFORMATION FOR SEQ ID NO: 13 0: 

( i ) SEQUENCE C>LARACTEHI ST ICS : 

(A) LENGTH: 472 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNE3S : double 

(D) TOPOLOGY :. linear 

(xi) SEQUENCE DESCPJIPTXON : SEQ ID NO : 130: 



CNGAAACCCC GTGAACCCTC CCCGGGTTAA AAAGCCCCCC CTAAATGGGG GGAACGCYTC 60 

ACACGTTATA AAAAAGCACT AGAATGTTTT GAAAGCGAGA AACAACAGCT GTGTAGGGTA 120 

GCTAGCAGTT AGTGtTGTAC AGAAGACAGA TATTTGTGCA TTTYTGCATT TTCTAAGTTT 130 

GCTGCAATGA GCATGTATTA CTTTCATAGT ■ TATAAAACAC ATGCAAAATC- CCCTTTTAAA 240 

ATGAAAAAAA ATCCATGAJGT GTAAGTGATA TATATGCTTT GGAAAGCCTG GGACGGTCAT 300 

TGTTTACTCT CAATAGTATG TGTTTGCCTT TGTCTTTTTG AGACATTTTG TTTTAATCTG 360 
TTGATGACAA TAACCTGTTG ATAATATAAC TTGATAACAA ATAAAATGAC TTATGATTGA ' 420 

AWMAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA NN 472 



(2) INF ORMAT ION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHAJIACTERISTICS : 

(A) LENGTH: 1950 base pairs 
• (3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



ACCTCTCAGA ATCTTCTCTC AGCAACCTGA GTCTTCGCCG TTCCTCAGAG CGCCTCAGTG SO 

ACACCCCTGG ATCCTTCCAG' TCACCTTCCC TGGAAATTCT GCTGTCCAGC TGCTCCCTGT 120 

GCCGTGCCTG TNATTCGCTG GTGTATGATG AGGAAATCAT GGCTGGCTGG GCACCTGATG 13 Q 

ACTCTAACCT CAACACAACC TGCCCCTTCT GCGCCTGCCC CTTTNTGCCC CTGCTCAGTG 240 
TCCAGACCNT TGATTCCCGG CCCAGTGTCC ^CCAGCCCCAA ATCTGCTGGT GCCAGTGGCA .. 300 

GCAAAGATGC TCCTGTCCCT GGTGGTCCTG GCCCTGTGCT CAGTGACCGA ASCTCTGCCT 260 

TGCTCTGGAT GAGCCCCAGC TCTGCAACGG GCACATGGGG GGAGCCTCCC GGCGGGTTGA 420 

GAGTGGGGCA TGGGCATACC TGAGCCCCCT GGTGCTGCGT AAGGAGCTGG AGTCGCTGGT 430 

AGAGAAGGAG GGCAGTGAGG TGCTGGCGTT GCCTGAACTG - CCCTCTGCCC ACCCCATCAT 54Q 

CTTCTGGAAC CTTTTGTGGT ATTTC CAACG GCTACGNCTG CCCAGTATTC TACCAGGCCT 600 

GGTGCTGGCC TCCTGTGATG GGCCTTCGMA CTCCCAGGCC CCATCTCCTT GGCTAACCCC 660 
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TGATCCAGCC TCTGTTCAGG TACGGCTGCT GTGGGATGTA CTGACCCCTG ACCCCAATAG 720 

CTGCCCACCT CTCTATGTGC TCTGGAGGGT CCACAGCCAG ATCCCCCAGC GGGTGGTATG 780 

5 

GCCAGGCCCT GTACCTGCAT CCCTTAGTTT GGCACTGTTG GAGTCAGTGC TGCGCCATGT 840 

TGGACTCAAT GAAGTGCACA AGGCTGTGGG GCTCCTGCTG GAAACTCTAG GGCCCCCicC 900 

10 CACTGGCCTG CACCTGCAGA GGGGAATCTA CCGTGAGATA TTATTCCTGA CAATGGCTGC 960 

TCTGGGCAAG • GACCACGTGG ACATAGTGGC CTTCGATAAG AAGTACAAGT CTGCCTTTAA 1020 

CAAGCTGGCC AGCAGCATGG GCAAGGAGGA GCTGAGGCAC CGGCGGGCGC AGATGCCCAC 1080 

15 

TCCCAAGGGC ATTGACTGCC GAAAATGTTT TGGAGCACCT CCAGAATGCT AGAGACCTTA 1140 

AGCTTCCCTC TCCAGCCTAG GGTGGGGAAG TGAGGAAGAA GGGATTCTAG AGTTAAACTG 1200 

20 CTTCGCTGTT GCCTTCATGG AGTTGGGAAC AGGCTGGGAA GGATGCCCAG TCAAAGGCTC 1250 

CAAGCGAGGA CAACAGGAAG AGGGATCCAC TGTTACCAAA AGTCCTGATT CCCCCATCAC 1320 

'CAACCTACCC AGTTTGTTCG TGCTGATGTT GGGGGAGATG TGGGGGGAGT TGGTACAGCT 1380 

25 

CTGTTCTTCC CTTGTCCTAT ACCGGGAACT CCCCTCCAGG GTACCCACAG ATCTGCATTG 1440 

CCCTGGTCAT TTTAGAAGTT TTTGTTTTAA AAAACAACTG GAAAGATGCA GAGCT ACTGA . 1500 

30 GCCTTTGCCC TGAATGGGAG GTAGGGATGT CATTCTCCAC CAATAATGGT CCCTCTTCCC 156Q 

TGACGTTGCT GAAGGAGGCC AAGGCTCTCC ATGCCTTTCT ACCTAAGTGT TTGTATTTTA 1520 

TTTTAAATTA TTTATTCTGG AGCCAGAGCG CCCTTGCTTA TGAGGTTCTT ATGGAGAGTG 1680 

35 

AGAAAGGGAA GGGAAATAGG GCACCATGGT CCGGTGGTTT GTAGTTCCTT GVAAGTCAGG 1740 

CACTGGGAGC TAGAGGAGTC TCAAGCTCCC CTTAGGAAGA AGTGGTGGCC CGTCCAGTCC 1300 

40 TAATTTTTCT TGCCTOCCCC GCCTTGGGGA ATGCCTCACC CACCCAGGTC CTGACCTGTG 1360 

CAATAAGGAT TGTTCCCTGC GAAGTTTTGT TGGATGTAAA TATAGTAAAA GCTGGTTCTG 1920 

TCTTTTTCAA AANAAAAAAA AAAAAAAACT 1950 

45 



(2) INFORMATION FOR SEQ ID NO: 122: 

50 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 990 base pairs 
(3) TYPE': nucleic acid 
(C) STRANDEDNESS : double 
55 (D) TOPOLCGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 13 2: 

TGGAAGA' T TT AAAATAGGTT TCATATTTCT CTTGAATATG AATATATAAG CTTGAATAAG 60 

60 
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ma-iGTccr ta-Tattatg aaattttcct tattatttct accaatgctt cttatattaa 

_-.CC CTGATCT TITTC^-TATT AGTATATGTA O.TTAGCTGC CTGTC-CATTA ACATTTCCAT 

GAAArcrArr tttgcattgt TraArcTTAA actttttgtg tctttatata aggtatgcty 

CTTTTAAGCA TGATAT-TTT AAZ GAG AAT A GTTGAAAGAC AATCTYO^CC TTTTACTTGT 
ATATTTAJZAT 3TAA~GTAAT TTTT'GATGCA TATTACGTCT TATTATTTAA CCAACCTATT 
ZTA.CTTTA.rC rAC-3GCAnT 'TTCAGAAAGC CTTATTTTCT TGTATTAATG AAATATTTTT 
---YCA ITGTAT rtrCCVrTAT TAGTTAGKAA TACGXTACYC YAAATATATA TTGTGG3TAT 
TITGAGAA.TT GGAATATGCC TCCTTAATTT ATTAGAGGCT AAC-CTAAATT ATTACTTTTA 
CCACTTACTT GAAAATTCTG GA-GTTT AGA ACATTTATTG TTTTATGCAT TTTAATTCTA 
CrrGTATTTT TACTACTCCT A.-A/GATTATT ATTGTTTTAG ACAAGCCAAA ATATATNTTG 
TTATTATCTT ATYCTCCATT TCTTTCTGTA TTTTTATGCC ACTATGTATG CTCAATTTCC 
TTCTATCTGA TGAACCT.^AT TCA3TACTTT TGTTTTTTAA TCTGTGCAGG TAGCCTGGCC 
ATTAAATTTT 7ATTTTTGGT TTGCTGAAAA AATTGTGTTT ATTtCTATAT GCATACTTAT 
CGA7ATAGAA ZMCTAG7r:;G ACATATTTTT AGTATTTATA AATGTAAAGT CATTWATTKG 

GcrrcTA.rcA rrrc:<— jGA g^aatcaatt gtcagcccaa tagtttttca ttttaaatta 

Oia-A.TTTTt TCAXGTGTGT GG7TTTAGGA 

(2) i-.rrG?:c-.rxc:i szq :d no: U3: 

(i) sz^ut:— CHARACTERISTICS: 

(Ai u-C: jGT>: : 1720 base pairs 
(3; TT?£: nucleic acid 
(C; STHAi-rHEMHSS : double 
(D) TOPOLC-GY: linear 

(xi) SZQUZITCZ DE3C7J_?TI0N: SEQ ID NO: 13 3: 

GTCTGAGAAG CGACTGGGGT TAXTCCCCTA AAGTTTACTT CAGCACTAAC ACTAGTGCTT 

CCGCTGGAGT TTGCAG7TTT CCkGCTTTAT ACAGGATTTT CCTTTGACTG GAAGAGTCAA 

GGATATAGAG ACrCA^GACT GACATTTATT (7TACAACATC AAGGGGAATA GGATACTCAT 

CAAACTGGGA TTATTCTTAT C-AAACATGG TCTTCTTTGA ATAAGAAAAA TACATAGTTG 

GTTA.TTA_TGG ACTT AA-ACT ' GTGTTAAATG GATATTCTGA TAAAATATTT GCTGCTCTGT 

AGA-GTGTGGA. AAATCTGA.GA ■ ATATTAGCTT TACTCATCTT GAGCTTTGAG GATGTTCTCT 

GTACGCCGAT GGTTTCATAT TAACTAAAAA AGCTGGGTAT TGTAAAATCT CATTTATAAA 

.-ACTCAG.ATG AGA-AGAAAAT TTTCTTTGAT GGTGAGACTG TTGTCTTAGT TCAGGAAATT 



